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SECRETED PROTEINS AND POLYNUCLEOTIDES ENCODING THEM 

5 This application is a continuation-in-part of application Ser. No. 60/XXX,XXX 

(converted to a provisional application from non-provisional application Ser. No. 
08/825,145), filed March 25, 1997, which is incorporated by reference herein. 

HELD OF THE INVENTION 
1 0 The present invention provides novel polynucleotides and proteins encoded by 

such polynucleotides, along with therapeutic, diagnostic and research utilities for these 
polynucleotides and proteins. 

BACKGROUND OF THE INVENTION 

15 Technology aimed at the discovery of protein factors (including e.g., cytokines, 

such as lymphokines, interferons, CSFs and interleukins) has matured rapidly over the 
past decade. The now routine hybridization cloning and expression cloning techniques 
clone novel polynucleotides "directly "_in the sense that they, rely on information directly 
related to the discovered protein (i.e., partial DNA/amino acid sequence of the protein 

2 0 in the case of hybridization cloning; activity of the protein in the case of expression 
cloning). More recent "indirect" cloning techniques such as signal sequence cloning, which 
isolates DNA sequences based on the presence of a now well-recognized secretory leader 
sequence motif, as well as various PCR-based or low stringency hybridization cloning 
techniques, have advanced the state of the art by making available large numbers of 

2 5 DNA/amino acid sequences for proteins that are known to have biological activity by 

virtue of their _secreted nature in the case of leader sequence cloning, or by virtue of the 

cell or tissue source in the case of PCR-based techniques. It is to these proteins and the 
polynucleotides encoding them that the present invention is directed. 
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SUMMARY OF THE INVENTION 
In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

5 NO:l; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 54 to nucleotide 737; 

. (c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 188 to nucleotide 671; 
10 (d) a polynucleotide comprising the nucleotide sequence of the full- 

length protein coding sequeince of clone bfl71_6 deposited' under accession 
number ATCC 98371; 

(e) ^ a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone bfl71„6 deposited under accession number ATCC 98371; 
15 (f) a polynucleotide comprising the nucleotide sequence of a mature 

protein coding sequence of clone bfl71_6 deposited under accession number 
ATCC 98371; " • . 

(g) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone bfl71_6 deposited under accession number ATCC 98371; 
2 0 (h) a polynucleotide encoding a protein comprising the amino acid 

sequence of SEQ ID NO:2; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:2 having biological activity, the fragment 
comprising the amino acid sequence from amino acid 109 to amino acid 118 of 

2 5 SEQipNO:2; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

3 0 (1) a polynucleotide capable of hybridizing under stringent conditions 



to any one of the polynucleotides specified in (a)-(i). 



Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:l from nucleotide 54 to nucleotide 737; the nucleotide sequence of SEQ ID NO:l from 
nucleotide 188 to nucleotide 671; the nucleotide sequence of the full-length protein coding 



2 
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sequence of clone bfl71_6 deposited under accession number ATCC 98371; or the 
nucleotide sequence of a mature protein coding sequence of clone bfl71_6 deposited 
under accession number ATCC 98371. In other preferred embodiments, the 
polynucleotide encodes the full-length or a mature protein encoded by the cDNA insert 
5 of done bfl71_6 deposited under accession number ATCC 98371. In yet other preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino acid sequence of SEQ ID NO:2 from amino acid 46 to amino acid 
206. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
10 IDNO:l^ 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:2; 
15 (b) the amino acid sequence of SEQ ID NO:2 from amino acid 46 to 

amino acid 206; 

(c) fragments of the amino acid sequence of SEQ ID NO:2 comprising 
the amino acid sequence from amino acid 109 to amino acid 118 of SEQ ID NO:2; 
and 

2 0 (d) the amino acid sequence encoded by the cDNA insert of clone 

bfl71_6 deposited under accession number ATCC 98371; 
the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:2 or the amino acid sequence 
of SEQ ID NO:2 from amino acid 46 to amino acid 206. 

2 5 In one embodiment, the present invention provides a composition comprising an 

isolated polynucleotide s^elected ffbifTthe group consisting of: ~ - - - 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:3; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 

3 0 NO:3 from nucleotide 135 to nucleotide 1169; 

(c) a polynucleatide-Comprising_the,nucleo.tide sejqueno^^ 



NO:3 from nucleotide 1 to nucleotide 875; 



3 
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.-(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone cklSl_7 deposited under accession 
number ATCC 98371; 

(e) a polynucleotide encoding the full-length protein encoded by the 
5 . cDNA insert of clone ckl81_7 deposited under accession number ATCC 98371; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone ckl81_7 deposited under accession number 
ATCC 98371; 

(g) a polynucleotide encoding a mature protein encoded by the cDNA 
10 insert of clone ckl81_7 deposited under accession number ATCC 98371; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:4; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:4 having biological activity, the fragment 

15 comprising the amino acid sequence from amino acid 167 to amino acid 176 of 

SEQ ID NO:4; 

(j) ^ a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide v^'hich encodes a species homologue of the protein 

2 0 of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
■NO:3 from nucleotide, 135 to nucleotide 1169; the nucleotide sequence of SEQ ID NO:3 
25 from nucleotide 1 to nucleotide 875; the nucleotide sequence ojf the full-length protein 
coding sequence of clone ckl81_7 deposited under accession number ATCC 98371; or the" 
nucleotide sequence of a mature protein coding sequence of clone- ckl81__7 deposited 
under accession number ATCC 98371. In other preferred embodiments, the 
polynucleotide encodes the full-length or a mature protein encoded by the cDNA insert ^ 

3 0 of clone ckl81_7 deposited under accession number ATCC 98371. In yet other preferred 

em&ddimentJ,~the~pfese^^ 

comprising the amino add sequence of SEQ ID NO:4 from amino acid 1 to amino acid 247. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
ID NO:3. 



4 
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In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:4; 
5 (b) the amino acid sequence of SEQ ID NO:4- from amino acid 1 to 

amino acid 247; 

(c) fragments of the amino acid sequence of SEQ ID NO:4 comprising 
the amino acid sequence from amino acid 167 to amino acid 176 of SEQ ID NO:4; 
and 

10 (d) the amino acid sequence encoded by the cDNA insert of clone 

ckl81_7 deposited under accession number ATCC 98371; 

the protein being substantially free from other mammalian proteins. Preferably such 

protein comprises the amino acid sequence of SEQ ID NO:4 or the amino acid sequence 

of SEQ ID NO:4 from amino acid 1 to amino acid 247. 
15 In one embodiment, the present invention provides a composition comprising an 

isolated polynucleotide selected from. the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:5; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
2 0 NO:5 from nucleotide' 882 to nucleotide 1106; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:5 from nucleotide 1050 to nucleotide 1106; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:5 from nucleotide 1028 to nucleotide 1395; 

2 5 (e) a polynucleotide comprising the nucleotide sequence of the full- 

length protein coding sequence of clone co736_3 deposited under accession 
number ATCC 98371; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone co736_3 deposited under accession number ATCC 98371; 

3-0 (g) a-polynucleotide-comprising-the-nuGleotide-sequence-of-a,mature 

protein coding sequence of clone co736_3 deposited under accession number 
ATCC 98371; 

(h) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone co736_3 deposited under accession number ATCC 98371; 
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(i) a polynucleotide encoding a protein conaprising the amino acid 
sequence of SEQ ID NO:6; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:6 having biological activity, the fragment 
5 comprising the amino acid sequence from amino acid 32 to amino acid 41 of SEQ 

ID NO:6; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h). above; 

(1) a polynucleotide which encodes a species homologue of the protein 
10 of (i) or (j) above ; and 

(m) a polynucleotide capable of hybridizing iinder stringent conditions 
to any one of the polynucleotides specified in (a)-(j). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:5 from nucleotide 882 to nucleotide 1106; the nucleotide sequence of SEQ ID NO:5 
15 from nucleotide 1050 to nucleotide 1106; the nucleotide sequence of SEQ ID NO:5 from 
nucleotide 1028 to nucleotide 1395; the nucleotide sequence of the full-length protein 
coding sequence of clone co736_3. deposited under acce'ssion number ATCC 98371; or the 
nucleotide sequence of a mature protein coding sequence of clone co736_3 deposited 
under accession number ATCC 98371. In other preferred embodiments, the 
2 0 polynucleotide encodes the full-length or a mature protein encoded by the cDN A insert 
of clone co736_3 deposited under accession number ATCC 98371. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
- IDNO:5. 

In other embodiments, the present invention provides a composition comprising 

2 5 a protein, wherein said protein comprises an amino acid sequence selected from the group 

consisting of: 

(a) the amino add sequence of SEQ ID NO:6; 

(b) fragments of the amino acid sequence of SEQ ID NO:6 comprising 
the amino acid sequence ifrom amino add 32 to amirto add 41 of SEQ ID NO: 6; and 

3 0 (c) the amino add sequence encoded by the cDNA insert of clone 



co736_3 d^eposited undeF ac'cessiofi number ATCC 98371^ 
the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:6. 



6 
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In one embodiment, the present invention provides a composition comprismg an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of 5EQ ID 

NO:7; 

5 (b) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:7 from nucleotide 2283 to nucleotide 2858; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:7 from nucleotide 1164 to nucleotide 1433; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
10 length protein coding sequence of clone dm26_2 deposited under accession 

number ATCC 98371; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone dm26_2 deposited under accession number ATCC 98371; 

(f) - a polynucleotide comprising the nucleotide sequence of a mature 
15 protein coding sequence of clone dm26_2 deposited under accession number 

ATCC 98371; 

(g) s polynucleohde encoding a mature. protein encoded by the cDNA 
insert of clone dm26_2-deposited under accession number ATCC 98371; 

(h) a polynucleotide encoding a protein comprising the amino acid 

2 0 sequence of SEQ ID NO:8; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:8 having biological activity, the fragment 
comprising the amino acid sequence from.amino acid 91 to amino acid 100 of SEQ 
IDNO:8; 

25 (j) a polynucleotide which is an allelic variant of a polynucleotide of 

(aHg) above; ~ " " — ^ " "7 

(k) a polynucleotide which encodes a species homologue of the protein 

of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
30 to any one of the polynucleotides specifiedin (a)-(i). 

"Preferablyrsuch-polynueleotide-Gomprises-the,nucleo_ticle_segu 



NO:7 from nucleotide 2283 to nucleotide 2858; ti^e nucleotide sequence of SEQ ID NO:7 
from nucleotide 1164 to nucleotide 1433; the nucleotide sequence of the full-length protein 
coding sequence of clone dm26_2 deposited under accession number ATCC 98371; or the 
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nucleotide sequence of a mature protein coding sequence of clone dm26_2 deposited 
under accession number ATCC 98371. In other preferred embodiments, the 
polynucleotide encodes the full-length or a mature protein encoded by the cDNA insert 
of clone dm26_2 deposited under accession number ATCC 98371. 
5 Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 

lb NO:7. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

1 0 (a) the amino acid sequence of SEQ ID NO:8; 

(b) fragments of the amino acid sequence of SEQ. ID NO:8 comprising 
the amino acid sequence from amino acid 91 to amino acid 100 of SEQ ID NO:8; 
and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
15 . dm26_2 deposited under accession number ATCC 98371; 

the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID jMO:8. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 
2 0 (a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:9; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:9 from' nucleotide 168 to nucleotide 683; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 

2 5 NO:9 from.nucleotide„318 to nucleotide 683; - 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone eq229_3 deposited vmder accession 
niimber ATCC 98371; 

(e) a polynucleotide encoding the full-length protein encoded by the 

3 0 cDNA insert of clone eq229_3 deposited under accession number ATCC 98371; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone eq229_3 deposited under accession number 
ATCC 98371; 
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(g) a polynucleotide encoding a maUire protein encoded by the cDNA 
insert of clone eq229_3 deposited under accession number ATCC 98371; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:10; 

5 (i) a polynucleotide encoding a protein comprising a fragment of the 

amino acid sequence of SEQ ID NOilO having biological activity, the fragment 
comprising the amino acid sequence from amino acid 81 to amino acid 90 of SEQ 
IDNO:10; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
10 (a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 
15 Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 

NO:9 from nucleotide 168 to nucleotide' 683; the nucleotide sequence of SEQ ID NO:9 from 
nucleotide 318 to nucleotide 683; the nucleotide sequence of the fuU-length protein coding 
sequence of clone eq229_3 deposited under accession number ATCC 98371; or the 
nucleotide sequence of a mature protein coding sequence of clone eq229_3 deposited 

2 0 under accession number ATCC 98371. In other preferred embodiments, the 

polynucleotide encodes the full-length or a mature protein encoded by the cDNA insert 
of clone eq229_3 deposited under accession number ATCC 9837L In yet other preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino acid sequence of SEQ ID NO:10 from amino acid 53 to amino acid 
25 172. _ _ _ 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
ID NO:9 or SEQ ID NO:ll. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 

3 0 consisting of: 



(a) the aminTD acid sequence of SEQTD NO:10; 



(b) the amino acid sequence of SEQ ID NO:10 from amino acid 53 to 
amino acid 172; 
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(c) fragments of the amino acid sequence of SEQ ID NO: 10 comprising 
the amino acid sequence from amino acid 81 to amino acid 90 of SEQ ID NO:10; 
and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
5 eq229_3 deposited under accession number ATCC 98371; 

the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:10 or the amino acid sequence 
of SEQ ID NO:10 from amino acid 53 to amino acid 172. 

In one embodiment, the present invention provides a composition comprising an 
1 0 isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:12; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:12 from nucleotide 67 to nucleotide 879; 

^5 . (c) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:12 from nucleotide 118 to nucleotide 879; 

(dj :a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:12 from nucleotide 1224 to nucleotide 2171; 

(e) a polynucleotide comprising the nucleotide sequence of the fuli- 
2 0 length protein coding sequence of clone fh3_6 deposited under accession number 

ATCC 98371; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone fh3_6 deposited under accession number ATCC 98371; 

(g) ' ^ polynucleotide comprising the nucleotide sequence of a mature 
2 5 - protein coding sequence of.clorie fh3.6 deposited under accession number ATCC 

98371; 

(h) a polynucleotide encoding a mature protein encoded by the cDN A 
insert of clone fh3_6 deposited under accession number ATCC 98371; 

(i) a polynucleotide encoding a protein comprising the amino acid 

_3 _g sequ^ce of SEQ ID NO:13; 

(j) a polynucleotide encoding a protein cbmprising^'fragment of the 
amino acid sequence of SEQ ID NO:13 having biological activity, the fragment 
comprising the amino acid sequence from amino acid 130 to amino acid 139 of 
SEQIDNO:13; 



10 
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(k) a poiynucleotide which is an allelic variant of a poljTiucleotide of 
(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 
of (i) or (j) above ; and 

5 (m) a polynucleotide capable of hybridizing under stringent conditioT\s 

to any one of the polynucleotides specified in (a)-(j).. 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:12 from nucleotide 67 to nucleotide 879; the nucleotide sequence of SEQ ID NO:12 
from nucleotide 118 to nucleotide 879; the nucleotide sequence of SEQ ID NO:12 from 

10 nucleotide 1224 to nucleotide 2171; the nucleotide sequence of the full-length protein 
coding sequence of clone fh3_6 deposited vmder accession number ATCC 98371; or the 
nucleotide sequence of a mature protein coding sequence of clone fh3_6 deposited under 
accession number ATCC 98371. In other preferred embodiments, the polynucleotide 
encodes the full-length or a mature protein encoded by the cDNA insert of clone fh3_6 

15 deposited under accession number ATCC 98371. In yet other preferred embodiments, 
the present invention provides a polynucleotide encoding a protein comprising the amino 
acid sequence of SEQ ID NO:13 from aihino acid 1 to amino acid 119. , 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
IDNO:12. 

20 In other embodiments/ the present invention provides a composition comprising 

a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:13; 

(b) 'the amino acid sequence of SEQ ID NO:13 from amino acid 1 to 
.2 5 amino acidj 19; 

(c) fragments of the amino add sequence of SEQ ID. NO: 1 3 comprising 
the amino acid sequence from amino acid 130 to amino acid 139 of SEQ ID NO:13; 
and 

(d) the amino acid sequence encoded by the cDN A insert of clone fh3_6 
3 0 deposited under accession number ATCC 98371; 

the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino add sequence of SEQ ID NO:13 oi: the amino acid sequence 
of SEQ ID NO:13 from amino add 1 to amino acid 119. 
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In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:14; 

5 (b) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:14 from nucleotide 2 to nucleotide 556; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:14 from nucleotide 53 to nucleotide 556; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID - 
10 NO:14 from nucleotide 1 to nucleotide 367; 

(e) a polynucleotide comprising the nucleotide sequence, of the full- 
length protein coding sequence of clone fs87_3 deposited under accession number 
ATCC 98371; 

(f) a polynucleotide encoding the full-length protein encoded by the 
1 5 cDNA insert of clone fs87_3 deposited under accession number ATCC 98371; 

(g) ^ polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone fs87_3 deposited under accession number ATCC 
98371; . ' 

(h) a polynucleotide encoding a mature protein encoded by the cDNA 
2 0 insert of clone fs87_3 deposited under accession number ATCC 98371; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:15; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
aniino acid sequence of SEQ ID NO:15 having biological activity, the fragment 

2 5 comprising the amino acid sequence from amino acid 87 to amino acid 96 of SEQ 

. IDNO:15; ------ 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 

3 0 - of (i) or (j) above ; and 

(m) a^olynucleotide capable of hybridizing under stringent-conditions - 
to any one of the polynucleotides specified in (a)-(j). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:14 from nucleotide 2 to nucleotide 556; the nucleotide sequence of SEQ ID NO: 14 from 
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nucleotide 53 to nucleotide 556; the nucleotide sequence of SEQ ID NO:14 from 
nucleotide 1 to nucleotide 367; the nucleotide sequence of the full-length protein coding 
sequence of clone fsS7_3 deposited under accession nunr\ber ATCC 98371; or the 
nucleotide sequence of a mature protein coding sequence of clone fs87_3 deposited under 
5 accession number ATCC 98371. In other preferred embodiments, the polynucleotide 
encodes the full-length or a mature protein encoded by the cDNA insert of clone fs87_3 
deposited under accession number ATCC 98371. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
IDNO:14. - " 

10 In other embodiments, the present invention provides a composition comprising 

a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:15; 

(b) fragments of the amino acid sequence of SEQ ID NO:l5 comprising 
15 the amino acid^sequence from amino acid 87 to amino acid 96 of SEQ ID NO:15; 

and 

(c) the amino acid sequence encoded by the cDNA insert of clone' 
fs87_3 deposited under accession niunber ATCC 98371; 

the protein being substantially free from other mammalian proteins. Preferably such 
2 0 protein comprises the amino acid sequence of SEQ ID NO:15. 

In one embodiment, the present invention provides a composition comprising ah 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:17; " 

2 5 (b) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:17lrom nudeotide'492 tonucleot^^ --- -- - 

(c) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone fy530__2 deposited under accession 
number ATCC 98371; 

3 0 (d) a polynucleotide encoding the full-length protein encoded by the 
cDN A^insert of clone fy530-2-deposited-under-aGGession-number ATCC-9837-1; 

(e) a polynucleotide comprising the nucleotide sequence of a mature 
. , protein coding sequence of clone fy530_2 deposited under accession number 
ATCC 98371; . 
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(f) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone fy530_2 deposited under accession number ATCC 98371; 

(g) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:lS; 

5 (h) a polynucleotide encoding a protein comprising a fragment of the 

amino acid sequence of SEQ ID NO:18 having biological activity, the fragment 
comprising the amino acid sequence from amino acid 13 to amino acid 22 of SEQ 
IDNp:18; - 

(i) a polynucleotide which is an allelic variant of a polynucleotide of 
10 (aHf) above; 

(j) a polynucleotide which encodes a species homologue of the protein 
of (g) or (h) above ; and 

(k) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(h). 
15 Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 

NO:17 from nucleotide 492 to nucleotide 602; the nucleotide sequence of the full-length 
protein coding sequence of clone fy530_2 deposited under accession number ATCC 98371; 
or the nucleotide sequence of a mature protein coding sequence of clone fy530_2 
deposited under accession number ATCC 98371. In other preferred embodiments, the 
2 0 polynucleotide encodes the full-length or a mature protein encoded by the cDNA insert 
of clone fy530_2 deposited under accession number ATCC 98371. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
ID NO:17/SEQ ID NO:16 or SEQ ID NO:19 . 

In other embodiments, the present invention provides a composition cornprising 

2 5 a protein, wherein said protein comprises an amino acid sequence selected from the group 

consisting of: 

(a) the amino acid sequence of SEQ ID NO: 18; 

(b) fragments of the amino acid sequence of SEQ ID NO: 18 comprising 
the amino acid sequence from amino acid 13 to amino acid 22 of SEQ ID NO: 18; 

3 0 and 

fy530„2 deposited under accession number ATCC 98371; 
the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:18. 



14 



wo 98/42741 



PCT/US98/05972 



In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:20; 

5 (b) . a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:20 from nucleotide 154 to nucleotide 972; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:20 from nucleotide 1 to nucleotide 341; 

(d) a polynucleotide comprising the nucleotide sequence of the fuU- 
1 0 length protein coding sequence of clone ge51_l deposited under accession number 

ATCC 98371; . 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone ge51_l deposited under accession number ATCC 98371; 

(f) a polynucleotide comprising the nucleotide sequence of a mature" 
1 5 protein coding sequence of clone ge51_l deposited under accession number ATCC 

98371; , . ^ , 

(g) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone ge51_l deposited under accession number ATCC 98371; 

(h) a polynucleotide encoding a protein comprising the amino acid 
2 0 sequence of SEQ ID NO:21; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:21 having biological activity, the fragment 
comprising the amino acid sequence from amino acid 131 to amino acid 140 of 
SEQIDNO:21; 

2 5 (j) a polynucleotide which is ah allelic variant of a polynucleotide of 

(aHg)'above; ^" ; " " — ~ 

(k) a polynucleotide which encodes a species homologue of the protein 

of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 

3 0 / to any one of the polynucleotides specified in (a)-(i). 

^Preferably,-suGh~polynuGleotide~comprises-the^nucleotide sequence of SEQ ID 

NO:20 from nucleotide 154 to nucleotide 972; the nucleotide sequence of SEQ ID NO:20 
from nucleotide 1 to nucleotide 341; the nucleotide sequence of the full-length protein 
coding sequence of clone ge51_l deposited under accession number ATCC 98371; or the - 
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nucleotide sequence of a mature protein coding sequence of clone ge51_l deposited under 
accession number ATCC 98371. In other preferred embodimenfs, the polynucleotide 
•encodes the full-length or a mature protein encoded by the cDNA insert of clone ge51_l 
deposited under accession number ATCC 98371. In yet other preferred embodiments, 
5 the present invention provides a polynucleotide encoding a protein comprising the amino 
acid sequence of SEQ ID NO:21 from amino acid 1 to amino acid 62. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
IDNO:20. . ^ 

In other embodiments, the present invention provides a composition comprising 
10 a protein, wherein said protein comprises an amino acid sequence selected from' the group 
consisting of: , 

(a) the amino acid sequence of SEQ ID NO:21; 

(b) the amino acid sequence of SEQ ID NO:21 from amino acid 1 to 
amino acid 62; , 

15 (c) fragments of the amino acid sequence of SEQ ID NO:21 comprising 

the amino acid sequence from amino acid 131 to amino acid 140 of SEQ ID NO:21; 

and . . • . . 

(d) the amino acid sequence encoded by the cDNA insert of clone 
ge51_l deposited under accession number ATCC 98371; 

2 0 the protein being substantially firee from other mammalian proteins. Preferably such 

protein comprises the amino acid sequence of SEQ ID NO:21 or the arnino acid sequence 
of SEQ ID NO:21 from amino acid 1 to amino acid 62. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 
25 (a) a polynucleotide comprising the nucleotide. sequence of SEQ ID 

NO:22; ' ' ~~ ~ 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
■ . NO:22 from nucleotide 104 to nucleotide 892; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 

3 0 Np:22 from nucleotide 299 to nucleotide 892; 

arp'olynucleotide comprising the nucleotide sequence-of-SEQ ID- 
NO:22 from nucleotide 798 to nucleotide 1261; 
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(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone gxlS3_l deposited under accession 
number ATCC 98371; 

(f) a polynucleotide encoding the full-length protein encoded by the 
5 cDNA insert of clone gxl83_l deposited under accession number ATCC 98371; 

(g) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone gxl83_l deposited under accession number 
ATCC 98371; 

(h) a polynucleotide encoding a mature protein encoded by the cDNA 
1 0 insert of clone gxl83_l deposited under accession number ATCC 98371; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:23; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:23 having biological activity, the fragment 
15 comprising the amino acid sequence from amino acid 126 to amino acid 135 of 

SEQIDNO:23; . . 

. (k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; ' 

(1) a polynucleotide which encodes a species homologue of the protein 
2 0 of (i) or (j), above ; and ^ 

(m) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(j). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:22 from nucleotide 104 to nucleotide 892; the nucleotide sequence of SEQ ID NO:22 

2 5 from nucleotide 299 to nucleotide 892; the nucleotide sequence of SEQ ID NO:22 from 

nucleotide 798 to nucleotide 1261 ;ythe nucleotide sequence of the full-length "protein 
coding sequence of done gxl83_l deposited under accession number ATCC 98371; or the 
nucleotide sequence of a mature protein coding sequence of clone gxl83_l deposited 
imder accession number ATCC 98371. In other preferred embodiments, the 

3 0 polynucleotide encodes the full-length or a mature protein encoded by the cDNA insert 
^of"clone-gxi83zl-deposited-under-acGession-number-ATCC-9837a.-Myet.other^^^ 

embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino acid sequence of SEQ ID NO:23 from amino add 53 to arnino acid 
89. 
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Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
ID NO:22. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
5 consisting of: . 

(a) the amino acid sequence of SEQ ID NO:2i3; 

(b) the amino acid sequence of SEQ ID NO:23 from amino acid 53 to 
amino acid 89; 

(c) fragments of the amino acid sequence of SEQ ID NO:23 comprising 
1 0 the amino acid sequence from amino acid 126 to amino acid 135 of SEQ ID NO:23; 

and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
gxl83_l deposited under accession number ATCC 98371; 

the protein being substantially free from other mammalian proteins. Preferably such 
15 protein comprises the amino acid sequence of SEQ ID NO:23 or the amino acid sequence 

of SEQ ID NO:23 from amino acid 53 to amino acid 89. 

In certain preferred emibbdiments, the polyrvucleotide is operably linked to an 

expression control sequence. The invention also provides a host cell, including bacterial, 

yeast, insect and mammalian cells, transformed with such polynucleotide compositions. 
2 0 Also provided by the present invention are organisms that have enhanced, reduced, or 

modified expression of the gene(s) corresponding to the polynucleotide sequences 

disclosed herein. 

Processes are also provided for producing a protein, which comprise: 

(a) :growing a culture of the host cell transformed with such 

2 5 polynucleoti*de compositions in a suitable culture medium; and 

(b) purifying the protein from the culture. 

The protein produced according to such methods is also provided by the present 
invention. Preferred embodiments include those in which the protein produced by such 
process is a mature form of the protein. 

3 0 Protein compositions of the present invention may further comprise, a 
pharmaceutically^acceptable-carrier:^ Compositions~comprising~an"~antibody"which™ 
specifically reacts with such protein are also provided by the present invention. 

Methods are also provided for preventing, treating or ameliorating a medical 
condition which comprises administering to a mammalian subject a therapeutically 
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effective amount of a composition comprising a protein of the present invention and a 
pharmaceutically acceptable carrier. 

BRIEF DESCRIPTION OF THE DRAWINGS 
5 Figures 1 A and IB are schematic representations of the pED6 and pNOTs vectors, 

respectively, used for deposit of clones disclosed herein. 

DETAILED DESCRIPTION 
ISOLATED PROTEINS AND POLYNUCLEOTIDES 

10 Nucleotide and amino acid sequences, as presently determined, are reported 

below for each clone and protein disclosed in the present application. The nucleotide 
sequence of each clone can readily be determined by sequencing of the deposited clone 
in accordance with known methods. The predicted amino acid sequence (both full-length 
and mature forms) can then be determined from such nucleotide sequence. The amino 

15 acid sequence of the protein encoded by a particular clone can also be. determined by 
expression of the clone in a suitable host cell, collecting the protein and determining its 
sequence. For each disclosed protein applicants- hav^ identified what they have 
determined to be the reading frame best identifiable with sequence information available 
at the time of filing. ^ 

2 0 As used herein a "secreted" protein is one which, when expressed in a suitable host 

cell, is transported across or through a membrane, including transport as a result of signal 
sequences in its amino acid sequence. "Secreted" proteins include without limitation 
proteins secreted wholly (e.g., soluble proteins) or partially (e.g. , receptors) from the cell 
in which they are expressed. "Secreted" proteins also include without liinitation proteins 

2 5 which are transported across the membrane of the endoplasmic reticulum. 

Clone "bfl71 &' 

A polynucleotide of the present invention has been identified as clone "bf 171_6". , 
bfl71_6 was isolated from a human fetal brain cDNA library using methods which are 

3 0 selective for cDNAs encoding secreted proteins (see U.S. Pat. No; 5^36,637), or was 
identified-as encoding-a-secreted-.-Gr~transmembrane-protein-on-the-basis-of-Comp.uter 

analysis of the amino acid sequence of the encoded protein, bf 171_6 is a full-length clone, 
including the entire coding sequence of a secreted protein (also referred to herein as 
'T?fl71.6 protein"). 
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The nucleotide sequence of bfl71_6 as presently determined is reported in SEQ ID 
NO:l. What applicants presently believe to be the proper reading frame and the predicted 
amino acid sequence of the bfl71_6 protein corresponding to the foregoing nucleotide 
sequence is reported in SEQ ID NO:2. 
5 The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 

bfl71_6 should be approximately 1900 bp. 

The nucleotide sequence disclosed herein for bfl71_6 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
PASTA search protocols. bfl71_6 demonstrated at least some similarity with sequences 

10 identified as AA147377 (zo39b08.rl Stratagene endothelial cell 937223 Homo sapiens 
cDNA clone 589239 5*), AA190936 (zp83e01.rl Stratagene HeLa cell s3 937216 Homo 
sapiens cDNA clone 626808 5'), AA287427 (zs52b05.rl NCI_CGAP_GCB1 Homo sapiens 
cDNA clone), H77893 (ys09f08.rl Homo sapiens cDNA), N72642 (yv74al2.rl Homo 
sapiens cDNA clone), T25271 (Human gene signature HUMGS07433), T35346 (EST83197 

15 Homo sapiens cDNA 5' end similar to None), and W27589 (34hl Human retina cDNA 
randomly primed sublibrary Homo). Based upon sequence similarity, bfl71„6 proteins 
and each similar protein or peptide iriay share at least some activity. 

Clone "ckl81 7" 

2 0 A polynucleotide of the present invention has been identified as clone "ckl81_7". 

ckl81„7 was isolated from a human adult testes cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino.acid sequence of the encoded protein. ckl81_7 is a full-length clone, 

2 5 includirlg the entire coding sequence of a secreted protein (also referred to herein as 

"ckl81_7 protein"). '~~ ~ ' 

The nucleotide sequence of ckl81_7 as presently determined is reported in SEQ 
ID NO:3. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the ckl81_7 protein corresponding to the foregoing 

3 0 nucleotide sequence is reported in SEQ ID NO:4. 

. XHe EcoRiyNotTrestrictioh'fragment obtainable from the deposit c 
ckl81_7 should be approximately 1475 bp. 

The nucleotide sequence disclosed herein for ckl81_7 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
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FASTA search protocols. ckl81_7 demonstrated at least some similarity with sequences 
identified as AA150370 {2l07e08.rl Soares pregnant uterus NbHPU Homo sapiens cDNA 
clone 491654 5'), H00151 {yl69h05.rl Homo sapiens cDNA clone 43510 5'), N21123 
(yx52f04.sl Homo sapiens cDNA clone 265375 3*). N31138 (yx52f04.rl Homo sapiens 
5 cDNA clone 265375 5'), R13827 (yf61h04.rl Homo sapiens cDNA done 26896 5' similar to 
SP:S42069 S42069 TEGT PROTEIN), and T19278 (Human gene signature HUMGS00295). 
The predicted amino acid sequence disclosed herein for ckl81_7 was searched against the 
GenPept and GeneSeq amino acid sequence databases using the BLASTX search protocol. 
The predicted ckl81_7 protein demonstrated at least some similarity to sequences 
10 identified as US8168 (weak similarity to rat TEGT protein (GI 456207) [Caenorhabditis 
elegans]). Based upon sequence similarity, ckl81_7 proteins and each similar protein or 
peptide may share at least some activity. The TopPredll computer program predicts 
seven potential transmembrane domains within the ckl81_7 protein sequence, centered 
around amino acids 93, 136, 168, 206, 229, 258, and 283 of SEQ ID NO:4, respectively. 

15 ^ . • " ■ 

Clone "co736 3" 

A polynucleotide of the present invention has-been identified as clone "co736_3". 
co736_3 was isolated from a human adult brain cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 

2 0 identified as encoding a secreted or transmembrane protein on the basis of computer 

analysis of the amino acid sequence of the encoded protein. co736_3 is a full-length clone, 
including the entire coding sequence of a secreted protein (also referred to herein as 
^ "co736_3i protein •). 

The nucleotide sequence of co736_3 as presently determined is reported in SEQ 
25 ' ID NO:5. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the co736_3 protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:6. Amino acids 44 to 56 are a predicted 
leader /signal sequence, with the predicted mature amino acid sequence begirming at 
amino acid 57, or are a transmembrane domain. 

3 0 The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 

co736_3 should b~e approximately^! 980 Bp: 

The nucleotide sequence disclosed herein for co736_3 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. co736_3 demonstrated at least some similarity with sequences 



21 



wo 98/42741 



PCT/US98/05972 



identified as H02676 (yj36g08.rl Homo sapiens cDNA), H47499 (yp74cl0.rl Homo sapiens 
cDNA clone 193170 5'), Q53478 (MLL gene 8.3 kb BamHI genomic region), T91862 
(yd54b07.sl Homo sapiens cDNA clone 112021 3''similar to SP:LIN1_NYCC0 P08548 
LINE-1 REVERSE TRANSCRIPTASE ;contains Alu repetitive element;contains LI 
5 repetitive element), U54776 (Human NTT gene, LI, Alu, and MER 38 repeat regions), 
Z73964 (Human DNA sequence from cosmid V698D2, between markers), and Z83843 
(Human DNA sequence from PAC 368A4 on chromosome X, Contains ESTs, CELLULAR 
NUCLEIC ACID BINDING PROTEIN (CNBP) like gene and STSs). Based upon sequence 
similarity, co736_3 proteins and each similar protein or peptide may share at least some 
10 activity. The TopPredll computer program predicts two potential transmembrane 
domains within, the co735_3 protein sequence, one centered around amino acid 16 and 
another around amino acid 51 of SEQ ID NO:6. The nucleotide sequence of co736_3 
indicates that it may contain one or more copies of the Alu repetitive element. 

15 Clone "dm26 T . . ' 

A polynucleotide of the present invention has been identified as clone "dm26_2'*. 
dm26_2 was isolated from. a hurrian -adult brain cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5^36,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 

2 0 analysis of the amino acid sequence of the encoded protein, dm26_2 is a full-length clone, 
including the entire coding sequence of a secreted protein (also referred to herein as 
"dm26_2 protein"). 

The nucleotide sequence of dm26_2 as presently determined is reported in SEQ 
ID NO:7. What applicants presently believe to be the proper . reading frame and the 

2 5 predicted amino acid sequence of the dm26_2 protein corresponding to the foregoing 

nucleotide sequence is reported in SEQ ID NO:8. Amino acids 9 to 21 of SEQ ID NO:8 are 
a possible leader/ signal sequence, with the predicted mature amino acid sequence 
beginning at amino acid 22, or are a transmembrane domain. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing done 

3 0 dm26_2 should be approximately 3500 bp. , 

Themicleo tide Tequenceliiscr6sed~li^fein' searched~agaifLSt"the" 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
PASTA search protocols. din26_2 demonstrated at least some similarity with sequences 
identified as AC000356 (Human cosmid gl346a312, complete sequence), F03454 (H. 
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sapiens partial cDNA sequence; clone c-lxhlO), N42290 (yy06a07.rl Homo sapiens cDNA 
clone 270420 5* similar to contains Ll.t3 LI repetitive element), N92463 (zbl2e05,sl Homo 
sapiens cDNA clone 301856 3'), N94118 (za25e06.rl Homo sapiens cDNA clone 293602 5^, - 
Q60160 (Human brain Expressed Sequence Tag EST02148), ZS3745 (Human DNA 
5 sequence from PAC 453A3 contains EST and STS), and Z99129 (Human DNA sequence 
SEQUENCING IN PROGRESS from clone 425C14; HTGS phase 1. 1). The predicted . 
amino acid sequence disclosed herein for dm26_2 was searched against the GenPept and 
GeneSeq amino acid sequence databases using the BLASTX search protocol. The 
predicted dm26_2 protein demonstrated at least some similarity to sequences identified 

10 as M22333 (unknown protein [Homo sapiens]), X61294 (LI retroposon, a portion of its 
ORF2 sequence' [Rattus norvegicus]), and Z81053 (E02A10.1 [Caenorhabditis elegans]). 
Based upon sequence-similarity, dm26_2 proteins and each similar protein or.peptide may 
share at least some activity. The nucleotide sequence of dm26„2 indicates that it may 
contain one or more of the following repetitive elements: Alu/Ll. 

15 . 

Clone "eq229 3" - 

A polynucleotide of the present "invention has been identified as clone "eq229_3". 

eq229_3 was isolated from a human adult testes cDNA library using methods which are . 

selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
20 identified as encoding a secreted or transmembrane protein on the basis of computer 

analysis of the amino acid sequence of the encoded protein. eq229_3 is a full-length clone, 

including the entire coding sequence of a secreted protein (also referred to herein as 

"eq229_3 protein"). 

The nucleotide sequence of the 5' portion of eq229_3 as presently determined is 

2 5 reported in SEQ ID NO:9. What applicants presently believe is the proper reading frame 

for the coding region is indicated in SEQ ID NO:10. The predicted amino acid sequence 
of the eq229_3 proteiri corresponding to the foregoing nucleotide sequence is reported in 
SEQ ID NO:10. Amino acids 38 to 50 are a predicted leader/ signal sequence, with the 
predicted mature _ amino acid sequence begirming at zimino acid 51, or are a 

3 0 transmembrane domain. Additional nucleotide sequence from the 3' portion of eq229-_3, 

Lncluciing~the~polyA' tail7is reported^ 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
eq229_3 should be approximately 1900 bp. 
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The nucleotide sequence disclosed herein for eq229_3 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. eq229_3 demonstrated at least some similarity with sequences 
identified as N52034 (y208g04.sl Homo sapiens cDNA clone 282486 3') and W01791 
5 (2a72d06.rl Soares fetal lung NbHL19W Homo sapiens cDNA clone 298091 5*). Based 
upon sequence similarity, eq229_3 proteins and each similar protein or peptide may share 
at least some activity. 

Clone "fh3 6" 

10 A polynucleotide of the present invention has been identified as clone "fh3_6". 

fh3_6 was isolated from a human fetal brain cDNA libraiy using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as. encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. fh3_6 is a full-length clone, 

15 including the entire coding sequence of a secreted protein (also referred to herein as 
••fh3„6 protein"). 

The nucleotide sequence of fh3_6 as presently 'determined is reported in SEQ ID 
NO:12. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the fh3_6 protein corresponding to the foregoing 
2 0 nucleotide sequence is reported in SEQ ID NO:13. Amino acids 5 to 17 of SEQ ID NO:13 
are a predicted leader/signal sequence, with the predicted mature amino acid sequence 
beginning at amino acid 18. Another potential fh3_6 reading frame and predicted amino 
acid sequence is encoded by basepairs 765 to 1556 of SEQ ID NO: 12 and is reported in 
SEQ ID NO:34. The overlapping open reading frames that encode SEQ ID NO:13 and 

2 5 SEQ ID NO:34 could be joined into a single oepn reading frame if a frameshift was 

introduced into the nucleotide sequence of SEQ ID NO:12 between base pairs 765 and 882. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
fh3_6 should be approximately 2300 bp. 

The nucleotide sequence disclosed herein for fh3_6 was searched against the 

3 0 GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA" search-protocols— fh3_6- demonstrated -at least-some" similarity with'sequences - 

identified as AA103102 (mol7f02.rl Life Tech mouse embryo 13 5dpc 10666014 Mus . 
musculus cDNA clone 553851 5'), W72947 (2d62gll.sl Soares fetal heart NbHH19W 
Homo sapier\s cDNA clone 345284 3'), W74413 (2d62gll.rl Soares fetal heart NbHH19W 
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Homo sapiens cDNA clone 345284 5'), and W88819 (2h71dll.rl Soares fetal liver spleen 
INFLS SI Homo sapiens cDNA clone 417525 5'). The predicted amino acid sequence 
disclosed herein for fh3_6 was searched against the GenPept and GeneSeq amino acid 
sequence databases using the BLASTX search protocol. The predicted fh3_6 protein 
5 demonstrated at least some similarity to sequences identified as Z81052) D2023.6 
[Caenorhabditis elegans]). Based upon sequence similarity, fh3_6 proteins and each 
similar protein or peptide may share at least some activity. The Motifs computer progras 
predicts a prenyl group binding site (C AAX box) at amino acid 268 of SEQ ID NO:13. 

10 Clone "fs87 3" 

A polynucleotide of the present invention has been identified as clone "fs87_3": 
fs87_3 was isolated from a human adult testes cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 

1 5 analysis of the amino acid sequence of the encoded protein. fs87_3 is a full-length clone, 
including the entire coding sequence of a secreted protein (also referred to herein as 

"fs87_3 protein"). " . ' . ^ - 

The nucleotide sequence of fs87__3 as presently determined is reported in SEQ ID 
NO:l4. What applicants presently believe to be the proper reading frame and the 
2 0 predicted amino acid sequence of the fs87_3 protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:15. Amino acids 5 to 17 are a predicted 
leader/signal sequence, with the predicted mature amino acid sequence beginning at 
amino acid 18, or are a transmembrane domain. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 

2 5 fs87_3 should be approximately 1300 bp. 

The nucleotide sequence disclosed herein for fs87_3 was searched against the 
GenBar\k and GeneSeq nucleotide sequence databases using BLASTN /BLASTX and 
FAST A search protocols: fs87_3 demonstrated at least some similarity with sequences 
identified as AA223699 (2rlOc04.sl Stratagene .NT2 neuronal precursor 937230 Homo 

3 0 sapiens cDNA clone 651078 3') and AA287263 (2s49h08.rl NCI_CGAP_GCB1 Homo 
sapiens-cDNA -clone- IMAGE:70G863 5-similar -to-SW:GG91-YEAST-P41-7-33 CELL 

DIVISION CONTROL PROTEIN 91). The predicted amino add sequence disclosed herein 
for fs87_3 was searched against the Ger\Pept and GeneSeq aniino add sequence, databases 
using the BLASTX search protocol. The predicted fs87_3 protein demonstrated at least 
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some similarity to sequences identified as L31649 (cdc91 [Saccharomyces cerevisiae]),. 
S72417 (E2 {patient 3) [hepatitis C virus]), U06711 (tracheobronchial mucin [Homo 
sapiens]), Z75550 (T22C1.3 [Caenorhabditis elegans]), and Z98598 (hypothetical protein 
[Schizosaccharomyces pombe]). Based upon sequence similarity, fs87_3 proteins and each 
5 similar protein or peptide may share at least some activity. The TopPredll computer 
program predicts two additional potential transmembrane domains within the fs87_3 
protein sequence, one centered around amino acid 90 and another around amino acid 170 
. of SEQIDNO:15. 

10 . Clone "fv530 2" 

A polynucleotide of the present invention has been identified as clone. "fy530„2". 
fy530_2 was isolated from a human adult placenta cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 

1 5 analysis of the amino acid sequence of the encoded protein. fy530_2 is a fuU-length clone, 
including the entire coding sequence of a secreted protein (also referred to herein as 
"fy530_2 protein"). _ '.' ■ . . " .. . ... 

The nucleotide sequence of the 5' portion of fy530_2 as presently determined is 
reported in SEQ ID NO:16. An additional internal nucleotide sequence from fy530_2 as 

2 0 presently determined is reported in SEQ ID NO:17. What applicants believe is the proper 
reading frame and the predicted amino acid sequence encoded by such internal sequence 
is reported in SEQ ID NO: 18. Additional nucleotide sequence from the 3' portion of 
fy530_2, including the polyA tail, is reported in SEQ ID NO:19. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 

2 5 fy530_2 should be approximately 3550 bp. 

The nucleotide sequence disclosed herein for fy530_2 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. fy530_2 demonstrated at least some similarity with sequences 
identified as AA029852 (zkllb04.sl Soares pregnant uterus NbHPU Homo sapiens cDNA 

3 0 clone 470191 3'), AA118938 (mp64g01.rl Soares 2NbMT Mus musculus cDNA clone 
— ^ — 574032 5'),'t39210(Human*inosine"monophospha^^ complete 

cds), N51229 (yzl3b07.sl Homo sapiens cDNA clone 282901 3'), and X95808 (H.sapiens 
mRNA for protein encoded by a candidate gene, DXS6673E, for mental retardation)*. The 
predicted amino acid sequence disclosed herein for fy530_2 was searched against the 
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GenPept and GeneSeq amino acid sequence databases using the BLASTX search protocol. 
The predicted fy530_2 protein demonstrated at least . some similarity to sequences 
identified as X95S08 (X-linked mental retardation candidate gene [Homo sapiens]). Based 
upon sequence similarity, fy530_2 proteins and each similar protein or peptide may share 
5 at least some activity. 

Clone "ee51 1" 

A polynucleotide of the" present invention has been identified as clone "ge51_l". 
ge51_l was isolated from a human adult brain cDNA library using methods which are 

10 selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. ge51_l is a full-length clone, 
including the entire coding sequence of a secreted protein (also referred to herein as 
"ge51_l protein"). .... 

15 The nucleotide sequence of ge51_l as presently determined is reported in SEQ ID 

NO:20. What applicants presently believe to -be the proper reading frame and the 
predicted amino acid sequence of the ge51_l protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:21. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 

2 0 ge51_l should be approximately 1850 bp. 

The nucleotide sequence disclosed herein for ge51_l was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN /BLASTX and 
PASTA search protocols. ge51_l demonstrated at least some similarity with sequences 
identified as AA219716 (zq98d02.rl Stratagene NT2 neuronal precursor 937230 Homo 

2 5 sapiens cDNA clone .650019 5'), AA434286 (2w30f01.rl Soares ovary tumor NbHOT Homo 
sapiens cDNA- clone -770809 -5' -similar to SW:NALS_BOVIN P08037 

N-ACETYLLACTOSAMINE SYNTHASE), D61576 (Human fetal brain cDNA 5 -end 
GEN-419H03), H30715 (yo78h01.rl Homo sapiens cDNA clone. 184081 5'), T80315 
. (ydOTbOB.rl Homo sapiens cDNA clone 24966 5'), U19889 (Callus gaUus 

3 0 beta-l,4-galactosyltransf erase (CKII) xnRNA, complete cds), and W90417 (2h72h01.sl 

Soares fetal liver s pleen IN FLS SI Homo sapie ns cDNA do ne 41764 9 3'). The, predicted 
amino acid sequence disclosed herein for ge51_l was searched against the GenPept and 
GeneSeq amino acid sequence databases using the BLASTX search protocol. The 
predicted ge51_l protein demonstrated at least some similarity to sequences identified as 
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M70433 (beta-l,4-galactosyI transferase [Homo sapiens]), R05932 (Human beta-1,4- 
galactosyltransferase), and beta-l,4-galactosylLransferases from several other species. 
Based upon sequence similarity, ge51_l proteins and each similar protein or peptide may 
share at least some activity. The TopPredll computer program predicts two potential 
5 transmembrane domains w^ithin the ge51_l protein sequence, one centered around amino, 
acid X20 and another around amino acid 90 of SEQ ID NO:21. 

Clone "exl83 1" 

A polynucleotide of the present invention has been identified as clone "gxl83_l". ' 
10 gxl83_l was isolated from a human adult braincDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. gxl83_l is a full-length clone, 
including the entire coding sequence of a secreted protein (also referred to herein as 
15 "gxl83_l protein"). 

The nucleotide sequence of gxl83_l as presently determined is reported in SEQ 
ID NO:22. What applicants presently believe to be" the proper reading frame and the 
predicted amino acid sequence of the gxl83_l protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:23. Amino acids 53 to 65 are a predicted 
2 0 leader/signal sequence, with the predicted mature amino acid sequence beginning at 
amino acid 66, or are a transmembrane domain. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
gxlS3_l should be approximately 2000 bp. 

The nucleotide sequence disclosed herein for gxl83_l was searched against the 

2 5 GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 

. FASTA search protocols. gxl83_l demonstrated at reast"s6me similarity with^equences 
identified as AA010474 (zi09a06.rl Scares fetal liver spleen INFLS SI Homo sapiens 
cDNA clone 430258 5'), H01847 (yj28f09.rl Homo sapiens cDNA clone 150089 5'), L38971 
(Mus musculus (E25) mRNA, complete cds), Q60909 (Human brain Expressed Sequence 

3 0 Tag EST00998), W37875 zclScOl.sl Scares parathyroid tumor NbHPA Homo sapiens 
jcDNA clone 3221 76 .3:),_and_W.72197. (2d69ell.sl_Soares-fetal -hear-t-NbHHl-9W-HGmp- 

sapiens cDNA clone 345932 3'). The predicted amino acid sequence disclosed herein for 
gxl83_l was searched against the GenPept and GeneSeq amino acid sequence databases 
using the BLASTX search protocol. The predicted gxl83_l protein demonstrated at least 
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some similarity to sequences identified as AL021786-(dJ695H22.1 (mouse E25 like protein) 
[Homo sapiens]) and L38971 (putative [Mus musculus]). Based upon sequence similarity, 
gxl83_l proteins and each similar protein or peptide may share at least some activity. 

5 Deposit of Clones 

Clones bfl71_6, ckl81_7, co736_3, dm26„2, eq229_3, fh3_6, fs87_3/ fy530_2, 
ge51_l, and gxl83_l were deposited on March 25, 1997 with the American Type Culture 
Collection as an original deposit under the Budapest Treaty and were given the accession 
number ATCC 98371, from which each clone comprising a particular polynucleotide is 
10 obtainable. All restrictions on the availability to the public of the deposited material will 
be irrevocably removed upon the granting of the patent; except for the requirements 
specified in 37 C.F.R. § 1.808(b), and the term of the deposit will comply with 37 C^F.R. 
§ 1.806. 

Each clone has -been transfected into separate bacterial cells (£. coli) in this , 
15 composite deposit. . Each clone can be removed from 'the vector in which it was deposited 
by performing an EcoRI/NotI digestion (5* site, EcoRI; 3' site, NotI) to produce the 
appropriate fragment for. such clone. Each clone was deposited in either the pED6 or 
pNOTs vector depicted in Fig. 1. The pED6dpc2 vector ("pED6") was derived from 
pED6dpcl by insertion of a new polylinker to facilitate cDNA cloning (Kaufman et aL, 
2 0 1991, Nucleic Acids Res, 19: 4485-4490); the pNOTs vector was derived from pMT2 
(Kaufman et aZ., 1989, MoL Cell Biol. 9: 946-958) by deletion of the DHFR sequences, 
insertion of a new polylinker, and insertion of the M13 origin of replication in the Clal site. 
In some instances, the deposited clone can become "flipped" (i.e., in the reverse 
orientation) in the deposited isolate. In such instances, the cDNA insert can still be 

2 5 isolated by digestion with EcoRI and NotI. However, NotI wUl then produce the 5* site 

and EcoRI will produce the 3* site for placement of the cDNA in proper orientation for 
expression in a suitable vector. The cDNA may also be expressed from the vectors in 
which they were deposited. 

Bacterial cells containing a particular clone can be obtained from the composite 

3 0 deposit as follows: 

^ArToligonudeotide probe"^^^^ 
known for that particular clone. This sequence can be derived from the sequences 
provided herein, or from a combination of those sequences. The sequence of the 
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oligonucleotide probe that was used to isolate each full-length clone is identified below, 
and should be most reliable in isolating the clone of interest- 



Clone 


Probe Sequence 


bfl71_6 


SEQIDNO:24 


ckl81_7 


SEQ ID NO:25 


co736_3 


SEQ ID NO:26 


dm26_2 


SEQ ID NO:27 


eq229_3 


SEQ ID NO:28 . 


fh3_6 


SEQIDNO:29 


fs87_3. 


■. SEQIDNO:30 


fy530_2 


SEQ ID NO:31 


ge51_l 


SEQ ID NO:32 


gxl83_l 


SEQ ID NO:33 



15 ■ , 

In the sequences listed above which include an N at position 2, that position is occupied 
in preferred probes /primers by a biatinylated phosphoaramidite residue rather than a 
nucleotide (such as , for exarnple, that produced by use of biotin phosphoramidite (1- 
dimethoxytrityloxy-2-(N-biotinyl-4-aminobutyl)-propyl-3-0-(2-cyanoethyl)-{N,N- 

2 0 diisopropyl)-phosphoramadite) (Glen Research, cat. no. 10-1953)). 

The design of the oligonucleotide probe should preferably follow these 
parameters: 

(a) It should be designed to an area of the sequence which has the fewest 
ambiguous bases ("N's"), if any; 
25. (b) It should be designed to have a T^^ of approx. 80 ° C (assiaming 2° for each 

A or T and 4 degrees for each G or C). . 
The oligonucleotide should preferably be labeled with g-^^P ATP (specific activity 6000 
Ci/mmole) and T4 polynucleotide kinase using commonly employed techniques for 
labeling oligonucleotides. Other labeling techniques can also be used. Unincorporated 

3 0 label should preferably be removed by gel filtration chromatography or other established 

ISiethocisr 'The amotmt3fl^ioactivify~incorporated"into'the probeshoiUdbequantitated 
by measurement in a scintillation counter. Preferably, specific activity of the resulting 
probe should be approximately 4e+6 dpm/pmole. 
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The bacterial culture containing the pool of full-length clones should preferably 
be thawed and 100 pi of the stock used to inoculate a sterile culture flask containing 25 ml 
of sterile L-broth containing ampicillin at 100 pg/mL The culture should preferably be 
grown to saturation at 37°C, and the saturated culture should preferably be diluted in 
5 fresh L-broth. Aliquots of these dilutions should preferably be plated to" determine the 
dilution and volume which will yield approximately 5000 distinct and well-separated 
colonies on solid bacteriological media containing L-broth containing ampicillin at 100 
pg/ml and agar at 1.5% in a 150 mm petri dish when grown overnight at 37°C. Other 
known methods of obtaining distinct, well-separated colonies can also be employed. 

10 Standard colony hybridization procedures should then be used to transfer the 

colonies to nitrocellulose filters and lyse, denature and bake them. 

The filter is then preferably incubated at 65°C for 1 hour with gentle agitation in 
6X SSC (20X stock is 175.3 g NaCl/liter, 88.2 g Na citrate /liter, adjusted to pH 7.0 with 
NaOH) containing 0.5% SDS, 100 pg/ml of yeast RNA, and 10 mM EDTA (approximately 

15 10 mL per 150 mm filter). Preferably, the probe is then added to the hybridization mix. at 
a concentration greater than or equal to le+6 dpm/mL. The filter is then preferably 
incubated at 65°C with gentle agitation overnight. The filter is then preferably washed in 
500 mL of 2X SSC/0.5% SDS at room temperature without agitation, preferably followed 
by 500 mL of 2X SSC/0.17o SDS at room temperature with gentle shaking for 15 minutes. 

2 0 A third wash with O.IX SSC/0.5% SDS at 65°C for 30 minutes to 1 hour is optional. The 
filter is then preferably dried and subjected to autoradiography for sufficient time to 
visualize the positives on the X-ray film. Other known hybridization methods can also 
be employed. 

The positive colonies are picked, grown in culture, and plasmid DNA isolated 

2 5 using standard procedures. The clones can then be verified by restriction analysis, 

hybridization analysis, or DNA sequencing. 

Fragments of the proteins of the present invention which are capable of exhibiting 
biological activity are also encompassed by the present invention. Fragments of the 
protein may be in linear form or they may be cyclized using known methods, for example, 

3 0 as described in H.U. Saragoyi, et al, Bio /Technology 10, 773-778 (1992) and in R-S. 
-McDowell7e^fli^,JTAmer^ehem.ScK:^Ti47 9245-9253 (1992)r^ 

herein by reference. Such fragments may be fused to carrier molecules such as 
immunoglobulins for many purposes, including increasing the valency of protein binding 
sites. For example, fragments of the protein may be fused through "linker" sequences to 
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the Fc portion of an immunoglobulin. For a bivalent form of the protein, such a fusion 
could be to the Fc portion of an IgG molecule. Other immunoglobulin isotypes may also 
be used to generate such fusions. For example, a protein - IgM fusion would generate a 
decavalent form of the protein of the invention. 
5 The present invention also provides both full-length and mature forms of the 

disclosed proteins. The full-length form of the such proteins is identified in the sequence 
listing by translation of the nucleotide sequence of each disclosed clone. The mature 
form(s) of such protein may be obtained by expression of the disclosed full-length 
polynucleotide (preferably those deposited with ATCC) in a suitable mammalian cell or 

10 other host cell. The sequence(s) of the mature form(s) of the protein may also be 
determinable from the amino acid sequence of the full-length form. 

The present invention also provides genes corresponding to the polynucleotide 
sequences disclosed herein. "Corresponding genes" are the regions of the genome that 
are transcribed to produce the mRNAs from which cDN A polynucleotide sequences are 

15 derived and may include contiguous regions of the genome necessary for the regulated 
expression of such genes. Corresponding genes may therefore include but are not limited 
to coding sequences, 5' and 3' untranslated regions, alternatively spliced exons, introris, 
promoters, enhancers, and silencer or suppressor elements. The corresponding genes can 
be isolated in accordance with known methods using the sequence information disclosed 

2 0 herein. Such methods include the preparation of probes or primers from the disclosed 

sequence information for identification and /or amplification of genes in appropriate 
genomic libraries or other sources of genomic materials. An "isolated gene" is a gene that 
has been separated from the adjacent coding seqiiences, if any, present in the genome of 
the organism from which the gene was isolated. 
25 Organisms that have enhanced, reduced, or modified expression of the gene(s) 

corresponding to the polynucleotide sequences disclosed herein are provided. The 
desired change in gene expression can be achieved through the use of antisense 
polynucleotides or ribozymes that bind and /or cleave the mRNA transcribed from the 
gene (Albert and Morris, 1994, Trends Pharmacol. ScL 15(7): 250-254; Lavarosky et al, 1997, 

3 0 Biochem. MoL Med. 62(1): 11-22; and Hampel, 1998, Prog. Nucleic Add Res. Mol Biol. 58: 1- 

39; all of^hlch are incorporated "by refeFence hYrein)T~Txarisgem 
multiple copies of the gene(s) corresponding to the polynucleotide sequences disclosed 
herein, preferably produced by transformation of cells with genetic constructs that are 
stably maintained within the transformed cells and their progeny, are provided. 
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Transgenic animals that have modified genetic control regions that increase or reduce 
gene expression levels, or that change temporal or spatial patterns of gene expression, are 
also provided (see European Patent No. 0 649 464 Bl, incorporated by reference herein). 
In addition,, organisms are provided in which the gene(s) corresponding to the 
5 polynucleotide sequences disclosed herein have been partially or-completely inactivated, 
through insertion of extraneous sequences into the corresponding gene(s) or through 
deletion of all or part of the corresponding gene(s). Partial or complete gene inactivation 
can be accomplished through insertion, preferably followed by imprecise excision, of 
transposable elements (Plasterk, 1992, Bioessays 14(9): 629-633; Zwaal et al, 1993, Proc. Natl 

1 0 Acad, Sci. USA 90(16): 7431-7435; Clark et al, 1994, Proc. Natl Acad. Scl USA 91(2): 719-722; 
all of which are incorporated by reference herein), or through homologous recombination, 
preferably detected by positive/ negative genetic selection strategies (Mansour et al, 1988, 
Nature 336: 348-352; U.S. Patent Nos. 5,464,764; 5,487,992; 5,627,059; 5,631,153; 5,614, 396; 
5,616,491; and 5,679,523; all of which are incorporated by reference herein). These 

15 organisms with altered gene expression are preferably eukaryotes and more preferably 
are mammals. Such organisms are useful for the development of non-human models for 
the study of disorders involving- the corresponding gene(s), and for the development of 
assay systems for the identification of molecules that interact with the protein product(s) 
of the corresponding gene(s). 

2 0 Where the protein of the present invention is membrane-boimd (e.g., is a receptor), 

the present invention also provides for soluble forms of such protein. In such forms part 
or all of the intracellular and transmembrane domains of the protein are deleted such that 
the -protein is fully secreted from the cell in which it is expressed. The intracellular and 
. transmembrane domains of proteins of the invention can be identified in accordance with 

2 5 known techniques for determination of such domains from sequence information. 

Proteins and protein fragments of the present invention include 'proteins with 
amino acid sequence lengths that are at least 25%(more preferably at least 50%, and most 
preferably at least 75%) of the length of a disclosed protein and have at least 60% sequence 
identity (more preferably, at least 75% identity; most preferably at least 90% or 95% 

3 p identity) with that disclosed protein, where sequence identity is deteirmined by comparing 

th~e~amino acid"sequences-of-the-proteins-when-aligned-so~as-to maximize overlap-and 
identity while minimizing sequence gaps. Also included in the present invention are 
proteins and protein fragments that contain a segment preferably comprising 8 or more 
(more preferably 20 or more, most preferably 30 or more) contiguous amino acids that 
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shares at least 75% sequence identity (more preferably, at least 85% identity; most 
preferably at least 95% identity) with any such segment of any of the disclosed proteins. 

Species homologues of the disclosed polynucleotides and proteins are also 
provided by the present invention. As used herein, a ''species homologue" is a protein or 
5 polynucleotide with a different species of origin from that of a given protein or 
polynucleotide, but with significant sequence similarity to the given protein or 
polynucleotide. Preferably, polynucleotide species homologues have at least 60% sequence 
identity (more preferably, at least 75% identity; most preferably at least 90%.identity) with 
the given polynucleotide, and protein species homologues have at least 30% sequence 

1 0 identity (more preferably /at least 45% identity; most preferably at least 60% identity) with 
the given protein, where sequence identity is determined by comparing the nucleotide 
sequences of the polynucleotides or the amino acid sequences of the proteins when 
aligned so as to maximize overlap and identify while minirnizing sequence gaps. Species 
homologues may be isolated and identified by making suitable probes or primers from 

15^ the sequences provided herein and screening a suitable nucleic acid source from the 
desired species. Preferably, species homologues are those isolated from mammalian 
species. Most preferably, species homologues are those isolated from certain mammalian 
species such as, for example. Pan troglodytes. Gorilla gorilla, Pongo px/gmaeus, Hylobates 
concolor, Macaca mulatta, Papio papio, Papio hamadryas, Cercopithecus aethiops, Cebxis capucinus, 

2 0 Aotus trivirgatiis, Sangiiimis oedipus, Microcebus murinus, Miis musculus/Rattiis norvegicus, 
Cricetulus griseiis, Felis catus, Mustela vison, Canis familiaris, Oryctolagus cuniculus, Bos taurus, 
Ovis aries, Siis scrofa, and Equus caballus, for which genetic maps have been created 
allowing the identification of syntenic relationships between the genomic organization of 
genes in one species and the genomic organization of the related genes in another species 

25 (O'Bneii and"Se"uane2^ 

Genetics 3:103-112; Johansson et ah, 1995, Genomics 25: 682-690; Lyons et al, 1997, Nature . 
Genetics 15: 47-56; O'Brien et al„ 1997, Trends in Genetics 13(10): 393-399; Carver and Stubbs, 
1997, Genome Research 7:1123-1137; all of which are incorporated by reference herein). 

The invention also encompasses allelic variants of the disclosed polynucleotides 
„3 0 — or-proteinshthat is,-naturally-OGCiirring~alternative-for^ 

which also encode proteins which are identical or have significantly similar sequences to 
those encoded by the disclosed polynucleotides.. ^Preferably, allelic variants have at least 
60% sequence identity (more preferably, at least 75% identity; most preferably at least 90% 
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identity) with the given polynucleotide, where sequence identity is determined by 
comparing the nucleotide sequences of the polynucleotides when aligned so as to maxirnize 
overlap and identity while minimizing sequence gaps. Allelic variants may be isolated and 
identified by making suitable probes or primers from the sequences provided herein and 
5 screening a suitable nucleic acid source from individuals of the appropriate species. 

The invention also includes polynucleotides with sequences complementary to 
those of the polynucleotides disclosed herein. 

The present invention also includes polynucleotides capable of hybridizing under 
reduced stringency conditions, more preferably stringent conditions, and most preferably 
1 0 highly stringent conditions, to polynucleotides described herein. Examples of stringency 
conditioris are shown in the table below: highly stringent conditions are those that are. at 
least as stringent as, for example, conditions A-F; stringent conditions are at least as 
stringent as, for example, conditions G-L; and reduced stringency conditions are at least 
as stringent as, for example, conditions M-R. 
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Stringency 
Condition 


Polynucleotide 
Hybrid 


Hybrid 
Length 


Hybridization Temperature and 
Buffer* 


Wash 

Temperature 
and Buffer^ 


A 


DNA:DNA 


^50 . 


65"C; IxSSC -or- 


65 °C; 0.3xSSC 


B 


DNAiDNA 


<50 


Tb*; IxSSC - 


Tg*; IxSSC 


C 


DNA:RNA 


i 50 


67°C; IxSSC -or- 

45 C; IxbbC, 50% rormamide 


67°C; 0.3xSSC 


D 


DNA:RNA 


<50 


Td*; IxSSC 


TdMxSSC 


E 


" RNA:RNA 


i 50 


70"C; IxSSC -or- 

DKJ ixoov_, ou/o rormamiae 


70°C; 0.3xSSC. 


F 


RNA:RNA 


<50 


Tp*; IxSSC 


Tp*; IxSSC 


G 


DNA:DNA 


2 50 


65''C;4xSSC-or- 

4^ 4xbbC^ ou /o rormamiae 


65 °C; IxSSC 


H 


DNA:DNA 


<50 


Th*; 4xSSC 


Th*; 4xSSC 


I 


DNA:RNA 


2 50 


67<»C; 4xSSC -or- 

45°C; 4xSSC, 50% formamide 


67°C; IxSSC 


J ■ 


DNAiRNA . 


<50 


Tj*;4xSSC 


TJ^ 4xSSC 


K 


RNA:RNA 


i 50 


70^C; 4xSSC -or- 

oU 4xbbL., oU /o tonmamide 


67° C; IxSSC 


L 


RNA:RNA 


: <50 


TL^2xSSC 


Tl*; 2xSSC 


M 


DNA:DNA 


i 50 


50**C; 4xSSC -or- 

4U L.; oxboL., dUvo tormamide 


50 °C; 2xSSC 


N 


DNA:DNA 


<50 


Tn*; 6xSSC 


T^,*; 6xSSC 


O 


DNA:RNA 


i 50 


55''C; 4xSSC-or- ' 

4z oxbbL-, /o rormarrude 


55 °C; 2xSSC 


P 


DNA:RNA 


<50 


Tp*; 6xSSC 


T/; 6xSSC 


Q 


■RNA;RNA 


2*50 


60*'C; 4xSSC -or- - 

45°C; 6xSSC, 50% formamide 


60°C; 2xSSC 


R 


RNA:RNA 


<50 _ 


Tr*;4xSSC 


Tr*;4xSSC^ 



h The hybrid length is that anticipated for the hybridized region(s) of the hybridizing polynucleotides. When 
hybridizing a polynucleotide to a target polynucleotide of unknown sequence, the hybrid length is assumed 
to be that of the hybridizing polynucleotide. When polynucleotides of known sequence are hybridized, the 

2 5 hybrid length can be determined by aligning the sequences of the polynucleotides and identifying the region 
or regions of optimal sequence complementarity. 

SSPE {IxSSPE is 0.15M NaCl, 10nr\M NaH^FO,, and 1.25mM EDTA, pH 7.4) can be substituted for SSC 
(IxSSC is 0.1 5M NaCl and 15mM sodium citrate) in the hybridization and wash buffers; washes are 
performed for 15 minutes after hybridization is complete. 

3-0 - TR- The hybridizabon-temperature-for'hybrids-antidpated-to be less than 50 base pairs in^ 

be S-IO'^C less than the melting temperature (T„)-of the hybrid, where T„ is determined according to the 
following equations.' For hybrids less than 18 base pairs in length, T^(°C) = 2(# of A + T bases) + 4(# of G + 
C bases). For hybrids between 18 and 49 base pairs in length, TJ*»C) = 81.5 + 16.6(logio[Na*]) + 0.41(%G+C) - 
(600/N), where N is the number of bases in the hybrid, and [Na*] is the concentration of sodium ions in the 

35 hybridization buffer ([Na*] for IxSSC = 0.165 M). 
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Additional examples of stringency conditions for polynucleotide hybridization are 
provided in Sambrook, J., E.F. Fritsch, and T. Maniatis, 1989, Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
chapters 9 and 11, and Current Protocols in Molecular Biology, 1995, F.M. Ausubel et al., eds., 
5 John Wiley & Sons, Inc., sections 2.10 and 6.3-6.4, incorporated herein by reference. 

Preferably, each such hybridizing polynucleotide has a length that is at least 
25%(more preferably at least 50%, and most preferably at least 75%). of the length of the 
polynucleotide of the present invention to which it hybridizes, and has at least 60% 
sequence identity (more preferably, at least 75% identity; most preferably at least 90% or 
10 95% identity) with the polynucleotide of the present invention to which it hybridizes, 
where sequence identity is determined by comparing the sequences of the hybridizing 
pol)mucleotides when aligned so as to maximize overlap and identity while minimizing 
sequence gaps. 

The isolated polynucleotide of the invention may be operably linked to an 
15 expression control sequence such as the pMT2 or pED expression vectors disclosed in 
Kaufman et al, Nucleic Acids Res. 19, 4485-4490 (1991), in order to produce the protein 
recombinantly. Many suitable expression control sequences are known in the art. General 
methods of expressing recombinant proteins are also known and are exemplified in R. 
Kaufman, Methods in Enzymology 185. 537-566 (1990). As defined herein "operably 
2 0 linked" means that the isolated polynucleotide of the invention and an expression control 
sequence are situated within a vector or cell in such a way that the protein is expressed 
by a host cell which has been transformed (transfected) with the ligated 
polynucleotide/expression control sequence. 

A number of types of cells may act as suitable host cells for expression of the 

2 5 protein. Mammalian host cells include, for example, monkey COS cells, Chinese Hamster 

Ovary (CHO) cells, human kidney 293 cells, human epidermal A431 cells, human Colo205 
cells, 3T3 cells, CV-1 cells, other transformed primate cell lines, normal diploid cells, cell 
strains, derived from in vitro culture of primary tissue, primary explants, HeLa cells, 
mouse L cells, BHK, HL-60, U937, HaK or Jurkat cells. 

3 0 Alternatively, it may be possible to produce the protein in lower eukaryotes such 

as"^ast~or~iirpfokari70fe"s~^^^ 

Saccharomyces cerevisiae,.Schizosaccharomyces pombe, Kluyveromyces strains, Candida, or any 
yeast strain capable of expressing heterologous proteins. Potentially suitable bacterial 
strains include Escherichia coli. Bacillus subtilis. Salmonella typhimurium, or any bacterial 
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strain capable of expressing heterologous proteins. If the protein is made in yeast or 
bacteria, it may be necessary to modify the protein produced therein, for example. by 
phosphorylation or glycosylation of the appropriate sites, in order to obtain the functional 
protein. Such covalent attachments may be accomplished using known chemical or 
5 enzymatic methods. 

The protein may also be produced by operably linking the isolated polynucleotide 
of the invention to suitable control sequences in one or more insect expression vectors, 
and employing , an insect expression system. . Materials and methods for 
baculovirus/insect cell expression systems are commercially available in kit form from, 
10 e.g., Invitrogen, San Diego, California, U.S.A. (the MaxBac® kit), and such methods are 
well known in the art, as described in Summers and Smith, Texas Agricultural Experiment 
Station Bulletin No. 1555 (1987), incorporated herein by reference. As used herein, an 
insect cell capable of expressing a polynucleotide of the present invention is 
"transformed." - 

15 The protein of the invention may be prepared by culturing transformed host cells 

.under culture conditions suitable to express the recombinant protein. The. resisting 
expressed protein xcvay then be purified from such culture (i.e., from culture medium or 
cell extracts) using known purification processes, such as gel filtration and ion exchange 
chromatography. The purification of the protein may also include an affinity column 

2 0 containing agents which will bind to the protein; one or more column steps over such 
affinity resins as concanavalin A-agarose, heparin-toyopearl® or Cibacrom blue 3GA 
Sepharose®; one or more steps involving hydrophobic interaction chromatography using 
such resins as phenyl ether, butyl ether, ..or propyl ether; or immunoaffinity 
chromatography. 

2 5__ __ . ^Alternatively, the protein of the inveivtion may also be expressed in a form which 

will facilitate purification. For example, it may be expressed as a fusion protein, such as 
those of maltose binding protein (MBP)^ glutathione-S-transferase (GST) or thioredoxin 
(TRX). Kits for expression and purification of such fusion proteins are commercially 
available from New England BioLab (Beverly, MA), Pharmacia (Piscataway, NJ) and 

3 0 InVitrogen, respectively. The protein can also be tagged with an epitope and 

subsequently purified by using a specific antibody directed to such epitope. One such 
epitope ("Flag") is commercially available from Kodak (New Haven, CT). 

Finally, one or more reverse-phase high performance liquid chromatography (RP- 
HPLC) steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant 
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methyl or other aliphatic groups, can be employed to further purify the protein. Some or 
all of the foregoing purification steps, in various combinations, can also be employed to 
provide a substantially homogeneous isolated recombinant protein. The protein thus 
purified is substantially free of other mammalian proteins and is defined in accordance 
5 with the present invention as an "isolated protein." 

The protein of the invention may also be expressed as a product of transgenic 
animals, e^g., as a component of the milk of transgenic cows, goats, pigs, or sheep which 
are characterized by somatic or germ cells containing a nucleotide sequence encoding the 
protein. 

10 The protein may also be produced by known conventional chemical synthesis. 

Methods for constructing the proteins of the present invention by synthetic means are 
known to those skilled in the art. The synthetically-constructed protein sequences, by 
virtue of sharing primary, secondary or tertiary structural and /or conformational 
characteristics with proteins may possess biological properties in common therewith, 

15 including protein activity. Thus, they may be employed as biologically active or 
immunological substitutes for natural, purified proteins in screening of therapeutic 
compounds and in immunological processes for the^development of antibodies. 

The proteins provided herein also include proteins characterized by amino acid 
sequences similar to those of purified proteins but into which modification are naturally 

2 0 provided or deliberately engineered. For example, modifications in the peptide or DNA 
sequences can be made by those skilled in the art using known techniques. Modificatipns 
of interest in the protein sequences may include the alteration, substitution, replacement, 
insertion or deletion of a selected amino acid residue in the coding sequence. For 
example, one or more of the cysteine residues may be deleted or replaced with another 

2 5 amino acid to alter the conformation of the molecule. Techniques for such alteration, 

substitution, replacement, insertion or deletion are well known to those skilled in the art 
(see, e.g., U.S. Patent No. 4,518,584). Preferably, such alteration, substitution, replacement, 
insertion or deletion retains the desired activity of the protein. 

Other fragments and derivatives of the sequences of proteins which would be 

3 0 expected to retain protein activity in whole or in part and may thus be useful for screening 

oFoTRS^imirmnologica^ 

given the disclosures herein. Such modifications are believed to be encompassed by the 
present invention. 
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AND H TOT.OGIC AL ACTIVITY 

The polynucleotides and proteins of the present invention are expected to exhibit 
one or more of the uses or biological activities (including those associated with assays 
cited herein) identified below. Uses or activities described for proteins of the present 
5 invention may be provided by admirustration or use of such proteins or by administrahon 
or use of polynucleotides encoding such proteins (such as, for example, in gene therapies 
or vectors suitable for introduction of DNA). 

Research Uses and Utilities 

^ 0 The polynucleotides provided by the present invention can be used by the research 

community for various purposes. The polynucleotides can be used to express 
- recombinant protein for analysis, characterization or therapeutic use; as markers for 
tissues in which the corresponding protein is preferentially expressed (either 
constitutively or at a particular stage of tissue differentiation or development or in disease 

15 states); as molecular weight markers on Southern gels; as chromosome markers or tags 
(when labeled) to identify chromosomes or to map related gene position's; to compare 
with endogenous DNA sequences inpatients to identify potential genetic disorders; as 
probes to hybridize and thus" discover novel, related DNA sequences; as a source of 
information to derive PGR primers for genetic fingerprinting; as a probe to "subtract-out" 

2 0 known sequences in the process of discovering other novel polynucleotides; for selecting 
and making oligomers for attachment to a "gene chip" or other support, including for 
examination of expression patterns; to raise anti-protein antibodies using DNA 
immuiuzation techniques; and as an antigen to raise anti-DNA antibodies or elicit another 
immune respor\se. Where the polynucleotide encodes a protein which binds or potentially 

2 5 binds to another protein (such as, for example,^in a receptor-ligand interaction), the 

polynucleotide can also be used in interaction trap assays (such as, for example, those 
described in Gyuris et al, 1993, Cell 75: 791-803 and in Rossi et a/., 1997, Proc. NatL^Acad, 
■ Scu USA 94: 8405-8410, all of which are incorporated by reference herein) to identify, 
polynucleotides encoding tiie other protein with which binding occurs or to identify 

3 0 inhibitors of the binding interaction. 
— T}^e-proteir^~provicied by the pr 

determine biological activity, including in a panel of multiple proteins for high- 
throughput screening; to raise antibodies or to elicit another immune respor\se; as a 
reagent (including the labeled reagent) in assays designed to quantitatively determine 
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levels of the protein (or its receptor) in biological fluids; as markers for tissues in which • 
the corresponding protein is preferentially expressed (either constitutively or at a 
particular stage of tissue differentiation or development or in a disease state); and, of 
course, to isolate correlative receptors or ligands. Where the protein binds or potentially 
5 binds to another protein (such as, for example, in a receptor-ligand interaction), the 
protein can be used to identify the other protein with which binding occurs or to identify 
inhibitors of the binding interaction. Proteins involved in these binding interactions can 
also be used to screen for peptide or small molecule inhibitors or agonists of the binding . 
interaction. 

10 Any or all of these research utilities are capable of being developed into reagent 

grade or kit format for commercialization as research products. 

Methods for performing the uses listed above are well known to those skilled in 

the art. References disclosing such methods include without limitation "Molecular 

Cloning: A Laboratory Manual", 2d ed.. Cold Spring Harbor Laboratory Press, Sambrook, 
15 J., E.F. Fritsch and T. Maniatis eds., 1989, and "Methods in Enzymology: Guide to 

Molecular Cloning Techniques", Academic Press, Berger, S.L. and A.R. Kimmel eds., 1987. 

Nutritional Uses 

Polynucleotides and proteins of the present invention can also be used as 
2 0 nutritional sources or supplements. Such uses include without limitation use as a protein 
or amino acid supplement, use as a carbon source, use as a nitrogen source and use as a 
source of carbohydrate. In such cases the protein or polynucleotide of the invention can 
be added to the feed of a particular organism or can be administered as a separate solid 
or liquid preparation, such as in the form of powder, pills, solutions, suspensions or 

2 5 capsules. In the case of microorganisms, the protein or polynucleotide of the invention 

can be added to the medium in or on which the microorganism is cultured. 

Cytokine and Cell Proliferation /Differentiation Activity 

A protein of the present invention may exhibit cytokine, cell proliferation (either 

3 0 inducing or inhibiting) or cell differentiation (either inducing or inhibiting) activity or may 
induce-production-of-other-eytokines-in-eertain-eell-populations.-Many-pr 

discovered to date, including all known cytokines, have exhibited activity in one or more 
factor dependent cell proliferation assays, and hence the assays serve as a convenient 
confirmation of cytokine activity. The activity of a protein of the present invention is 
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evidenced by any one of a number of routine factor dependent cell proliferation assays 
for cell lines including, without linnitation, 32D, DA2, DAlG, TIO, B9, B9/11, Ba.F3, 
MC9/a M+ (preB 2E8, RB5, DAI, 123, T1165, HT2, CTLL2, TF-1, Mo7e and CMK. 

5 The activity of a protein of the invention may, among other means, be measured 

by the following methods: 

Assays for T-cell or thymocyte proliferation include without limitation those 
described in: Current Protocols in Immunology, Ed by J. E. Coligan, A.M. Kruisbeek, D.H. 
Margulies, E.M. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley- 

1 0 Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Fimction 3.1-3.19; Chapter 
7, Immunologic studies in Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; 
BertagnoUi et ah, J. Immunol. 145:1706-1712, 1990; Bertagnolli et al., Cellular Immvmology 
133:327-341, 1991; Bertagnolli, et al., J. Immunol. 149:3778-3783, 1992; Bowman et al, J. 
Immunol. 152: 1756-1761, 1994. 

1 5 * Assays for cytokine production and/ or proliferation of spleen cells, lymph node 

cells or thymocytes include, without limitation, those described in: Polyclonal T ceU 
stimulation, Kruisbeek, A.M. and Shevach, E.M. In Current Protocols in Immunology. J.E.e.a. 
Coligan eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, . Toronto. 1994; and 
Measurement of mouse and human Interferon y, Schreiber, R.D. In Current Protocols in 

2 0 Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 1994. 

Assays for proliferation and differentiation of hematopoietic and lymphopoietic 
cells include, without Limitation, those described in: Measurement of Human and Murine 
Interleukin 2 and Interleukin 4, Bottomly, K., Davis, L.S. and Lipsky, P.E. In Current 
Protocols in Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and Sons, 

2 5 Toronto. 1991; deVries et al., J. Exp. Med. 173:1205-1211, 1991; Moreau et al.. Nature 

336:690-692, 1988; Greenberger et al., Proc. Natl. Acad. Sci. U.S.A. 80:2931-2938, 1983; 
Measurement of mouse and human interleukin 6 - Nordan, R. In Current Protocols in 
Immunology, J.E.e.a. Coligan eds. Vol 1 pp. 6.6.1-6.6.5, John Wiley and Sons, Toronto. 1991; 
Smith et al., Proc. Natl. Acad. Sci. U.S.A. 83:1857-1861, 1986; Measurement of human. 

3 0 Interleukin 11 - Bennett, F., Giannotti, J., Clark, S.C. and Turner, K. J. In Current Protocols 

Measurement of mouse and human Interleukin 9 - Ciarletta, A., Giannotti, J., Clark, S.C. 
and Turner, K.J. In Current Protocols in Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.13.1, 
John Wiley and Sons, Toronto. 1991. 
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Assays for T-cell clone responses to antigens (which will identify, among others, • 
proteins that affect APC-T cell interactions as well as direct T-cell effects by measuring 
proliferation and cytokine production) include, without limitation, those described in: 
Current Protocols in Immunology, Ed by J. E. Coligan, A.M. Kruisbeek, D.H. Margulies, 
5 ■ E.M. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley-Interscience 
(Chapter 3, In Vitro assays for Mouse Lymphocyte Function; Chapter 6, Cytokines and 
their cellular receptors; Chapter 7, Immunologic studies in Humans); Weinberger et aL, 
Proc. Natl. Acad. Sci.' USA 77:6091-6095, 1980; Weinberger et al., Eur. J. Immun. 
11:405-411, 1981; Takai et al., J. Immunol; 137:3494-3500, 1986; Takai et al., J. Immunol. 
10 140:508-512, 1988. 

Immune Stimulating or Suppressing Activity 

A protein of the present invention may also exhibit immune stimulating or 
immune suppressing activity/ including without limitation the activities for which assays . ' 

15 are described herein. A protein may be useful in the treatment of various immune 
deficiencies and disorders (including severe combined immunodeficiency (SCID)), e.g., 
in regulating (up or down) growth and proliferation-of T and/ or B lymphocytes, as well 
as effecting the cytolytic activity of NK cells and other cell populations. These immune 
deficiencies may be genetic or be caused by viral (e.g., HIV) as well as bacterial or fungal 

20 . infections, or may result from autoimmune disorders. More specifically, infectious 
diseases causes by viral, bacterial, fungal or other infection may be treatable using a 
protein of the present invention, including infections by HIV, hepatitis viruses, 
herpesviruses, mycobacteria, Leishmania spp., malaria spp.rar^d various fungal infections 
such as candidiasis. Of course, in this regard, a protein of the present invention may also 

2 5 be useful where a boost to the immime system generally may be desirable, i.e., in the 

r " treatment of 7;^cer" ^ ~ 

Autoimmune disorders which may be treated using a protein of the present 
invention include, for example, connective tissue disease, multiple sclerosis, systemic 
lupus erythematosus, rheumatoid arthritis, autoimmune pulmonary inflammation, 

3 0 Guillain-Barre syndrome, autoimmune thyroiditis, insulin dependent diabetes mellitis, 
myasthenia-gravis,^graft-versus-host-disease-and-autoimmiane-infl — 

Such a protein of the present invention may also to be useful in the treatment of allergic 
reactions and conditior\s, such as asthma (particularly allergic asthma) or other respiratory 
problems. Other conditions, in which immune suppression is desired (including, for 
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example, organ transplantation), may also be treatable using a protein of the present 
invention. 

Using the proteins of the invention it may also be possible to immune responses, ' 
in a number of ways. Down regulation may be in the form of inhibiting or blocking an 
5 immune response already in progress or may involve preventing the induction of an 
immune response. The functions of activated T cells may be inhibited by suppressing T 
cell responses or by inducing specific tolerance in T cells, or both. Immunosuppression 
of T cell responses is generally an active, non-antigen-specific, process which requires 
continuous exposure of the T cells to the suppressive agent. Tolerance, which involves 
10 inducing non-responsiveness or anergy in T cells, is distinguishable from 
immunosuppression in that it is generally antigen-specific and persists after exposure to 
the tolerizing agent has ceased. Operationally, tolerance can be demonstrated by the lack 
of a T cell response upon reexposure to specific antigen in the absence of the tolerizing 
agent. ' - 

15 Down regulating or preventing one or more antigen functions (including without 

- limitation B lymphocyte antigen functions (such as , for example, B7)), e.g., preventing 
high level lymphokine synthesis by activated T cells, will be useful in situations of tissue, 
skin and organ transplantation and in graft-versus-host disease (GVHD). For example, 
blockage, of T cell function should result in reduced tissue destruction in tissue 

20 transplantation. Typically^ in tissue transplants, rejection of the transplant is initiated' 
through its recognition as foreign by T cells, followed by an immune reaction that destroys 
the transplant. The administration of a molecule which inhibits or blocks interaction of • 
a B7 lymphocyte antigen with its natural ligand(s) on immune cells (such as a soluble, 
monomeric form of .a peptide having B7-2 activity alone or . in conjtmction with a 

2 5 monomeric form of a peptide having an activity of another B lymphocyte antigen (e.^., B7- 

. 1, B7-3) or blocking antibody), prior to transplantation can lead to the binding of the 
molecule to the natural ligand(s) on the immune cells without transmitting the 
corresponding costimulatory signal. Blocking B lymphocyte antigen hinction in this 
matter prevents cytokine synthesis by immune cells, such as T cells, and thus acts as an 

3 0 immvmosuppressant. Moreover, the lack of costimulation may- also be sufficient to 

anergizelhe T~cells7 thereby^ ihciucing~^toreFance~ih^ 

tolerance by B lymphocyte antigen-blocking reagents may avoid the necessity of repeated 
administration of these blocking reagents. To achieve sufficient immunosuppression or 
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tolerance in a subject, it may also be necessary to block the function ot a combination ot 
B lymphocyte antigens: 

The efficacy of particular blocking reagents in preventing organ transplant 
rejection or GVHD can be assessed using arumal models that are predictive of efficacy in 
5 humans. Examples of appropriate systems v^hich can'be used include allogeneic cardiac 
grafts in rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been, 
used to examine the immunosuppressive effects of CTLA4Ig fusion proteins in vivo as 
described in Lenschow et al. Science 257:789-792 (1992) and Turka et uL, Proc. Natl. Acad. 
Sci USA, 89:11102-11105 (1992). In addition, murine models of GVHD (see Paul ed., 
10 Fundamental Immunology, Raven Press, New York, 1989, pp. 846-847) can be used to 
determine the effect of blocking B lymphocyte antigen function in vivo on the development 
of that disease. 

Blocking antigen function may also be therapeutically useful for treating 
autoirnmune diseases. Many autoimmune disorders are the result of inappropriate 
1 5 activation of T ceils that are reactive against self tissue and which promote the production 
of cytokines and autoantibodies involved in the pathology of the diseases. Preventing the 
activation of autoreactive T - cells may reduce or eliminate disease . symptoms. 
Administration of reagents which block costimulation of T cells by disrupting 
receptor:ligand interactions of B lymphocyte antigens can be used to irlhibit T cell 

2 0 activation and prevent production of autoanhbodies or T cell-derived cytokines which 

may be involved in the disease process. Additionally, blocking reagents may induce 
antigen-specific tolerance of autoreactive T ceils which could lead to long-term relief from 
the disease. The efficacy of blocking reagents in preventing or alleviating autoimmune 
disorders can be determined using a . number of well-characterized animal models of 
25 human autoimmune diseases. Examples include murine experimental autoimmune 
^encephalitis, systemic" lupus ierythmatos^^^^ MRt/7pr/7pr mice or NZB hybrid- mice, 
murine autoinm\ime collagen arthritis, diabetes mellitus in NOD mice and BB rats, and 
murine experimental myasthenia gravis (see Paul ed.. Fundamental Immunology, Raven 
Press, New York, 1989, pp. 840-856). 

3 0 Upregulation of an antigen function (preferably a B lymphocyte antigen function), 
— -as -a -means-of--up-reguiating-immune„resp.ortses,_rnay also be useful in therapy. 

Upregulation of immune responses may be in the form of enhcmdng an existing immime 
response or eliciting an initial immune response. For example, er\hancing an immune 
response through stimulating B lymphocyte antigen function may be useful in cases of 
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viral infection. In addition, systemic viral diseases such as influenza, the common cold, 
and encephalitis might be alleviated by the administration of stimulatory forms of B 
lymphocyte antigens systemically.' 

Alternatively, anti-viral immune responses may be enhanced in an infected patient 
5 by removing T cells from the patient, costimulating the T cells in vitro with viral antigen- 
pulsed APCs either expressing a peptide of the present invention or together with a 
stimulatory form of a soluble peptide of the present invention and reintroducing the in 
vitro activated T cells into the patient. Another method of enhancing anti-viral immune 
responses would be to isolate infected cells from a patient, transfect them with a nucleic 

10 acid encoding a protein of the present invenrion as described herein such that the cells 
express all or a portion of the protein on 'their surface, and reintroduce the transfected 
cells into the patient. The infected cells would now be capable of delivering a 
costimulatory signal to, and thereby activate, T cells in vivo. 

In another application, up regulation or enhancement of antigen function 

15 (preferably B lymphocyte antigen function) may be useful in the induction of tumor 
Lmmimity. Tumor cells (e,^., sarcoma, melanoma, lymphoma, leukemia, neuroblastoma, 
carcinoma) transfected with a niicleic acid encoding at least one peptide of the present 
invention can be administered to a subject to overcome tumor-specific tolerance in the 
subject. If desired, the tumor cell can be transfected to express a combination of peptides. 

2 0 For example, tumor cells obtained from a patient can be transfected ex vivo with an 
expression vector directing the expression of a peptide haying B7-2-like activity alone, or 
in conjunction with a peptide having B7-l-like activity and/or B7-3-like activity. The 
transfected tumor cells are returned to the patient to result in expression of the peptides 
on the surface of the transfected cell. Alternatively, gene therapy techniques can be used 

2 5 to target a tumor cell for transfection in vivo. 

The presence of the peptide of the present invention having the activity of a B 
lymphocyte antigen{s) on the surface of the tumor cell provides the necessary 
costimulation signal to T cells to induce a T cell mediated immune response against the 
transfected tumor cells. In addition, tumor cells which lack MHC class I or MHC class 11 

3 0 molecules, or which fail to reexpress sufficient amounts of MHC class I or MHC class 11 

mdlecules7 call'be^ tran¥fected~witK~nuc acid encoding all or a portiOTTof (e.g., a 
cytoplasmic-domain truncated portion) of an MHC class I a chain protein and 
microglobulin protein or an MHC class II a chain protein and an MHC class II P chain 
protein to thereby express MHC class I or MHC class II proteins on the cell surface. 
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Expression of the appropriate class I or class 11 MHC in conjiinction with a peptide having 
the activity of a B lymphocyte antigen {e.g., B7-1, B7-2, B7-3) induces a T cell, mediated 
immune response against the transfected tumor cell. Optionally, a gene encoding an 
antisense construct which blocks expression of an MHC class II associated protein, such 
5 as the invariant chain, can also be cotransfected with a DNA encoding a peptide having 
the activity of a B lymphocyte antigen to promote presentation of tumor associated 
antigens and induce tumor specific immunity. Thus, the induction of a T cell mediated 
immune response in a human subject may be" sufficient to overcome tumor-specific 
tolerance in the subject. 

1 0 The activity of a protein of the invention may, among other means, be measured 

by the following methods: 

. : Suitable assays • for thymocyte or splenocyte cytotoxicity include, without 
limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A.M. 
Kruisbeek, D.H. Margulies, E.M. Shevach, W Strober, Pub. Greene Publishing Associates. 
. 15 . and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1- 
3.19;. Chapter 7, Immimologic studies in Humans); Herrmann et al., Proc. Natl. Acad. Sci. 
-USA 78:2488-2492, 1981; Herrmarm et aL, J. Immunol. 128:1968-1974, 1982; Handa et al., 
J. Immunol. 135:1564-1572, 1985; Takai et al., J. Immunol. 137:3494-3500, 1986; Takai et ah, 
J. Immunol. 140:508-512/ 1988; Herrmann et al., Proc. Natl. Acad, Sci. USA 78:2488-2492, 

2 0 1981; Herrmann et al., J. Immunol. 128:1968-1974; 1982; Handa et al., J, Immunol. 

135:1564-1572, 1985; Takai et aL, J. Immunol. 137:3494-3500, 1986; BowmanefaL, J. 
Virology 61:1992-1998; Takai et al., J. Immunol. 140:508-512, 1988; Bertagnolli et al.. 
Cellular Immunology 133:327-341, 1991; Brown et al., J. ImmunoL 153:3079-3092, 19.94. 
Assays for T-c"ell-dependent immtmoglobulin responses and isotype switching 

2 5 (which will identify, among others, proteins that modulate T-cell dependent antibody 
responses ancl^tKat affect TKl7Th2 profiles)~ihclude, without limitatioh7tHbse describe^^" 
in: Maliszewski, J. ImmunoL 144:3028-3033, 1990; and Assays for B cell function: In vitro 
antibody production, Mond, J.J, and Brunswick, M. In Current Protocols in Immunology, 
J.E:e.a. Coligan eds. Vol 1 pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994. 

30 ' Mixed lymphocyte reaction (MLR) assays (which will identify, among, others, 

- - — ^proteins that-generate-predonunantIy-Thl~and G include,-without-limita tion, 

those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A.M. Kruisbeek,, 
D.H. Margulies, EJvl. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley- 
Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 
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7, Immunologic studies in Humans); Takai et al.^ J. Immunol. 137:3494-3500, 1986; Takai 
et al., J. Immunol. 140:508-512, 1988; Bertagnolli et al., J. Immunol. 149:3778-3783, 1992. 

Dendritic cell-dependent assays (which will identify, among others, proteins 
expressed by dendritic cells that activate naive T-cells) include, without limitation, those 
5 described in: Guery et al., J. Immunol. 134:536-544, 1995; Inaba et al. Journal of 
Experimental Medicine 173:549-559, 1991;;Macatonia et al.. Journal of Immunology 
154:5071-5079, 1995; Porgador et al. Journal of Experimental Medicine 182:255-260, 1995; 
Nair et al.. Journal of Virology 67:4062-4069, 1993; Huang et al.. Science 264:961-965, 
1994; Macatonia et aL, Journal of Experimental Medicine 169:1255-1264, 1989; Bhardwaj 

10 et al.. Journal of Clinical Investigation 94:797-807, 1994; and Inaba et aL, Journal of 
Experimental Medicine 172:631-640, 1990. 

Assays for lymphocyte survival /apoptosis (which will identify, among others, 
proteins that prevent apoptosis after superantigen induction and proteins that regulate 
lymphocyte homeostasis) include, without limitation, those described in: Darzynkiewicz 

1 5 . et al.. Cytometry 13:795-808, 1992; Gorczyca et aL, Leukemia 7:659-670, 1993; Gorczyca et 
aL, Cancer Research 53:1945-1951, 1993; Itoh et aL, Cell 66:233-243, 1991; Zacharchuk, 
Joumal of Immunology 145:4037-4045, 1990; Zamai et aL, Cytometry 14:891-897, 1993; 
Gorczyca et aL, International Joumal of Oncology 1:639-648, 1992. 

Assays for proteins that influence early steps of T-cell commitment and 

2 0 development . include, without limitation, those described in: Antica et aL, Blood 
84:111-117, 1994; Fine et aL, Cellular Immunology 155:111-122, 1994; Galy et aL, Blood 
85:2770-2778, 1995; Toki et aL, Proc. Nat. Acad Sci. USA 88:7548-7551, 1991. 

Hematopoiesis Regulating Activity 

'25 A protein of the present invention may be useful in regulation of hematopoiesis 

and, consequently, in the treatment of myeloid or lymphoid cell deficiencies. Even 
marginal biological activity in support of colony forming cells or of factor-dependent cell 
lines indicates involvement in regiilating hematopoiesis, e.g. in supporting the growth and 
proliferation of erythroid progenitor cells alone or in combination with other cytokines, 

30 thereby indicating utility, for example, in treating various anemias or for use in 
conjulictiofrwiH^Tinfadiafi to yfimulafe~tRe~"pYoduction of erythfoid 

precursors and/or erythroid cells; in supporting the growth and proliferation of myeloid 
cells such as .granulocytes and monocytes /macrophages (i.e., traditional CSF activity) 
useful, for example, in conjunction with chemotherapy to prevent or treat consequent 
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myelo-suppression; in supporting the growth and proliferation of megakaryocytes and 
consequently of platelets thereby allowing prevention or treatment of various platelet 
disorders such as thrombocytopenia, and generally for use in place of or complimentary 
to platelet transfusions; and/or in supporting the growth and proliferation of 
5 hematopoietic stem cells which are capable of maturing to any and all of the above- 
mentioned hematopoietic cells and therefore find therapeutic utility in various stem cell 
disorders (such as those usually treated with transplantation, including, without 
limitation, aplastic anemia and paroxysmal nocturnal hemoglobinuria), as well as in 
repopulating the stem. cell compartment post irradiation/ chemotherapy, either in-vivo or 

10 eX'Vivo (i.e., in conjunction with bone marrow transplantation or with peripheral 
progenitor cell transplantation (homologous or heterologous)) as normal cells or 
genetically manipulated for gene therapy. 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 

15 Suitable assays for proliferation and differentiation of various hematopoietic lines 

are cited above. ' 

Assays for embryonic stem cell differentiation (which will identify, among others, 
proteins that influence embryonic dilFferefitiation hematopoiesis) include, without 
limitation, those described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller et 

2 0 al.. Molecular and Cellular Biology 13:473-486, 1993; McClanahan et al.. Blood 
81:2903-2915,1993. 

Assays for stem cell survival and differentiation (which will identify, among 
others, proteins that regulate lympho-hematopoiesis) include, without limitation, those 
described in: Methylcellulose colony forming assays, Freshney, M.G. In Culture of 

2 5 Hematopoietic Cells. RJ. Freshney, et al, eds. Vol pp. 265-268, Wiley-Liss, Inc., New York, 
NY\ 1994; -Hirayama et al.;TrocrNatI. Aca^^^^ 1992; "Primitive 

hematopoietic colony forming cells with high proliferative potential, McNiece; I.K. and 
Briddell, R.A. In Culture of Hematopoietic Cells. R.I. Freshney, et al. eds. Vol pp. 23-39, 
Wiley-Liss, Inc., New York, NY. 1994; Neben et ah. Experimental Hematology 22:353-359, 

3 0 1994; Cobblestone area forming cell assay, Ploemacher, R.E. In Culture of Hematopoietic 
Cells. R.L_Ereshney,,ei_a;.^eds.y.ol-pp.-l=21,_Wiley--Liss,-Inc..,-New^ 

term bone marrow cultures in the presence of stromal cells, Spooncer, E., Dexter, M, and 
. Allen, T. In Culture of Hematopoietic Cells. R.I. Freshney, et al. eds. Vol pp. 163-179, 
Wiley-Liss, Inc., New York, NY. 1994; Long term culture Initiating cell assay, Sutherland, 
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HJ. In Culture of Hematopoietic Cells. R.I. Freshney, et al. eds. Vol pp. 139-162, Wiley-Liss, 
Inc., New York, NY. 1994. 

Tissue Growth Activity 
5 A protein of the present invention also may have utility in compositions used for 

bone,, cartilage, tendon, ligament and/or nerve tissue growth or regeneration, as well as 
for wound healing and tissue repair and replacement, and in the treatment of bums, 
incisions and ulcers. 

A protein of the present invention, which induces cartilage and /or bone growth 
10 in circumstances where bone is not normally formed, has application in the healing of 
bone fractures and cartilage damage or defects in humans and other animals. Such a 
preparation employing a protein of the invention may have prophylactic use in closed as 
well as open fracture reduction and also in the improved fixation of artificial joints. De 
novo bone formation -induced by an osteogenic agent contributes to the repair of 
1 5 congenital, trauma induced, or oncologic resection induced craniofacial defects, and also 
is useful in cosmetic plastic surgery. 

A protein of this invention may also be used in the treatment of periodontal 
disease, and in other tooth repair processes. Such agents may provide an environment 
to attract bone-forming- cells, stimulate growth of bone-forming cells or induce 
2 0 differentiation of progenitors of bone-forming cells. A protein of the invention may also. 
. be useful in the treatment of osteoporosis or osteoarthritis, such as through stimulation 
of bone and /or cartilage repair or by blocking inflammation or processes of tissue 
destruction (coUagenase activity, osteoclast activity, etc.) mediated by inflammatory 
. processes. . . 

2 5 Another category of tissue regeneration activity that may be attributable to the 

protein of the present invention is tendon/ligament formation. A protein of the present 
invention, which induces tend on /ligament-like tissue or other tissue formation in. 
circumstances where such tissue is not normally formed, has application in the healing of 
tendon or ligament tears, deformities and other tendon or ligament defects in humans and 

3 0 other animals. Such a preparation employing a tendon/ligament-like tissue inducing 
pfotein may have prophylactic use' in preventing damage to tendon or ligament tissueras 

v^eU* as use in the improved fixation of tendon or ligament to bone or other tissues, and 
in repairing defects to tendon or ligament, tissue. De novo tendon/ligament-like tissue 
formation induced by a composition of the present invention contributes to the repair of 
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congenital, trauma induced, or other tendon or ligament defects of other origin, and is also 
useful in cosmetic plastic surgery for attachment or repair of tendons or ligaments. The 
compositions of the present invention may provide an environment to attract tendon- or 
ligament-forming cells, stimulate growth of tendon- or ligament-forming cells, induce 
5 differentiation of progenitors of tendon- or ligament-forming cells, or induce growth of 
tendon/ligament cells or progerutors ex vivo for return in vivo to effect tissue repair. The 
compositions of the invention may also be useful in the treatment of tendinitis, carpal 
tunnel syndrome and other tendon or ligament defects. The compositions may also 
include an appropriate matrix and/ or sequestering agent as a carrier as is well known in 
10 the art. 

The protein of the present invention may also be useful for proliferation of neural 
cells and for regeneration of nerve and brain tissue, i.e. for the treatment of central and 
peripheral nervous system diseases and neuropathies, as well as mechanical and 
traumatic disorders, which involve degeneration, death or trauma to neural cells or nerve 

15 tissue. More specifically, a protein may be used in the treatment of diseases of the 
peripheral nervous system, such as peripheral nerve injuries, peripheral neuropathy and 
localized neuropathies, and central nervous system diseases, such as Alzheimer's/ 
Parkinsons disease, Huntington s disease, amyotrophic lateral sclerosis, and Shy-Drager 
syndrome. Further conditions which may be treated in accordance with the present ^ 

2 0 invention include mechanical and traumatic disorders, such as spinal cord disorders, head 
trauma and cerebrovascular diseases such as stroke. Peripheral neuropathies resulting 
from chemotherapy or other rnedical therapies may also be treatable using a protein of the 
invention. 

Proteins of the invention may also be useful to promote better or faster closure of 

2 5 non-healing woimds, including without limitation pressure ulcers, ulcers associated with 
~~~~ Oscular insufficiOTcy^ 

It is expected that a protein of the present invention may also exhibit activity for 
generation or regeneration of other tissues, such as organs (including, for example, 
pancreas, liver, intestine, kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) 

3 0 and vascular (including vascular endothelium) tissue, or for promoting the growth of cells 
comprising^such tissues.~Part-of-the^desired-effects-may~be~by4rihibition-or-mod^^^ 
of fibrotic scarring to allow normal tissue to regenerate. A protein of the invention may 
also exhibit angiogenic activity. 
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A protein of the present invention may also be useful for gut protection or 
regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, ' 
and conditions resulting from systemic cytokine damage. 

A protein of the present invention may also be useful for promoting or inhibiting 
5 differentiation of tissues described above from precursor tissues or cells; or for inhibiting 
the growth of tissues described above. 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 

Assays for tissue generation activity include, without limitation, those described 
10 in: International Patent Publication No. WO95/16035 (bone/ cartilage, tendon); 
International Patent Publication No. WO95/05846 (nerve, neuronal); International Patent 
Publication No. W091 /07491-(skin, endothelium ). 

Assays for wound healing activity include, without limitation, those described in: 
Winter, Epidermal Wound Healing, pps: 71-112 (Maibach, HI and Rovee, DT, eds.). Year 
15 Book Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. 
Dermatol 71:382-84 (1978). 

Activin/Inhibin Activity 

A protein of the present invention may also exhibit activin- or inhibin-related 
2 0 activities. Inhibins are characterized by their ability to inhibit the release of follicle 
stimulating hormone (FSH), while activins and are characterized by their ability to 
. stimulate the release of follicle stimulating hormone (FSH). Thus, a protein of the present 
invention, alone or in heterodimers with a member of the inhibin a family, may be useful 
as a contraceptive based on the ability of inhibins to decrease fertility in female mammals 

2 5 and decrease spermatogenesis in male mammals. Administration of sufficient amounts 

of other inhibins can induce infertility in these mammals. Alternatively, the protein of the 
invention, as a homodimer or as a heterodimer with other protein subunits of the inhibin- 
P group, may be useful as a fertility inducing therapeutic, based upon the abUity of activin 
molecules in stimulating FSH release from cells of the anterior pituitary. See, for example, 

3 0 United States Patent 4,798,885. A protein of the invention may also be useful for 
~ ad vancementof "theor^etof fertiUtyin^seWally^ mammals, so as To~ina:easethe~ 

lifetime reproductive performance of domestic animals such as cows, sheep and pigs. 

The activity of a protein of the invention may, among other means,.be measured 
by the following methods: 
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Assays for activin/inhibin activity include, without limitation, those described in: * 
Vale et al.. Endocrinology 91:562-572, 1972; Ling et al.. Nature 321:779-782, 1986;Vale et 
aL, Nature 321:776-779, 1986; Mason et al.. Nature 318:659-663, 1985; Forage et aL, Proc. 
Natl. Acad. Sci. USA 83:3091-3095, 1986. 
5 . • 

Chemotactic/Chemokinetic Activity 

A protein of the present invention may have chemotactic or chemokinetic activity 
(e.g., act as a chemokine) for mammalian cells, including, for example, • monocytes, 
fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial and/or endothelial cells. 
1 0 Chemotactic and chernokinetic proteins can be used to mobilize or attract a desired cell 
population to a desired site of action. Chemotactic or chemokinetic proteins provide 
particular advantages in treatment of wounds and other trauma to tissues, as well as in 
treatment of localized infections. For example, attraction of lymphocytes, monocytes or 
neutrophils to tumors. or sites of infection may result in improved immune responses 
15 against the tumor or infecting agent. . . .. 

A protein or peptide has chemotactic activity for a particular cell population if it 
can stimulate, directly or indirectly, the directed orientation or movement of such cell 
population. Preferably, the protein or peptide has the ability to directly stimulate directed 
movement of cells. Whether a particular protein has chemotactic activity for a population 
2 0 of cells can be readily determined by employing such protein or peptide in any known 
assay for cell chemotaxis. * , 

The activity of a protein of the invention may, among other means^ be measured 
by the following methods: 

Assays for chemotactic activity (which will identify proteins that induce or prevent 

2 5 chemotaxis) consist of assays that measure the ability of a protein to induce the migration 

of cells across a membrane as well as the ability of a protein to induce the adhesion of one 
cell population to another cell population. Suitable assays for movement and adhesion 
include, without limitation, those described in: Current Protocols in Immunology, Ed by 
J.E. Coligan, A.M. Kruisbeek, D.H. Margulies, E.M. Shevach, W.Strober, Pub. Greene 

3 0 Publishing Associates and Wiley-Interscience (Chapter 6.12, Measurement of alpha and 
~~ "beta Cheim)kines-6.12;^^^^ 



APMIS 103:140-146, 1995; Muller et al Eur. j: Immimol. 25: 1744-1748; Gruber et al. J. of 
Immunol. 152:5860-5867, 1994; Johr\ston et al. J. of Immunol. 153: 1762-1768, 1994. 
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Hemostatic and Thrombolytic Activity 

A protein of the invention may also exhibit hemostatic or thrombolytic activity. 
As a result, such a protein is expected to be useful in treatment of various coagulation 
disorders (including hereditary disorders, such as hemophilias) or to enhance coagulation 
5 and other hemostatic events in treating wounds resulting from trauma, surgery or other 
causes. A protein of the invention may also be useful for dissolving or inhibiting 
formation of thromboses and for treatment and prevention of conditions resulting 
therefrom (such as, for example, infarction of cardiac and central nervous system vessels 
(e.g., stroke). 

10 The activity of a protein of the invention may, among other means, be measured 

by the following methods: 

Assay for hemostatic and thrombolytic activity include, without limitation, those 
described in: Linet et al., J. Clin. Pharmacol. 26:131-140, 1986; Burdick et al.. Thrombosis 
Res. 45:413-419, 1987; Humphrey et al.. Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 

15 35:467-474, 1988*. \ ' ' 

Receptor/Ligand Activity ' _ . 

A protein of the present invention may also demonstrate activity as receptors, 
receptor ligands or inhibitors or agonists of receptor/ligand interactions. Examples of 

2 0 such receptors and ligands include, without limitation, cytokine receptors and their 

ligands, receptor kinases and their ligands, receptor phosphatases and their ligands, 
receptors involved in cell-cell interactions and their ligands (including without limitation, 
cellular adhesion molecules (such as selectins, integrins and their ligands) and 
receptor/ligand pairs involved in antigen presentation, antigen recognition and 
25 development of cellular and humoral immune responses). Receptors and ligands are also 
useful for screening of potential peptide or small molecule inhibitors of the relevant 
. receptor/ligand interaction. A protein of the present invention (including, without 
limitation, fragments of receptors and ligands) may themselves be useful as inhibitors of 
receptor/ligand interactions. 

3 0 The activity of a protein of the invention may, among other means, be measured 
by the following methods: * 

Suitable assays for receptor-ligand activity include without limitation those 
described in:Current Protocols in Immimology, Ed by J.E. Coligan, A.M. Kruisbeek, D.H. 
Margulies, E.M. Shevach, W.Strober, Pub. Greene Publishing Associates and 
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Wiley-Interscience (Chapter 7.28, Measurement of Cellular Adhesion under static 
conditions 7.28.1-7,28.22), Takai et al., Proc. Natl. Acad. Sci. USA 84:6864-6868^ 1987; 
Bierer et al., J. Exp. Med. 168:1145-1156,- 1988; Rosenstein et aL, J. Exp, Med. 169:149-160 
1989; Stoltenborg et al., J. Immunol. Methods 175:59-68, 1994; Stitt et al.. Cell 80:661-670, 
5 . 1995. 

Anti-Inflammatorv Activity - 

Proteins of the present invention may also exhibit anti-inflammatory activity. The 
anti-inflammatory activity may be achieved by providing a stimulus to cells involved in 

1 0 the inflammatory response, by inhibiting or promoting cell-cell interactions (such as, for 
example, cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the 
inflammatory process, inhibiting or promoting cell extravasation, or by stimulating or 
suppressing production of other factors which more directly inhibit or promote an 
inflammatory response. Proteins exhibiting such- activities can be used to treat 

15 inflammatory conditions including chronic or acute conditions), including without 
limitation inflammation associated with infection (such as septic shock, sepsis or systemic 
inflammatory response syndrome (SIRS)), . ischemia-reperfusion injury, . endotoxin 
lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or 
chemokine-induced limg injury, inflammatory bowel disease, Crohn's disease or resulting 

2 0 from over production of cytokines such as TNF or IL-1. Proteins of the invention may also 
be useful to treat anaphylaxis and hypersensitivity to an antigenic substance or material. 

Cadherin/Tumor Invasion Suppressor Activity 

Cadherins are calcium-dependent adhesion molecules that appear to play major 

2 5 roles during development, particularly in defining specific cell types. Loss or alteration 

of normal cadherin expression canTlead to charige's in ceiradResion properties linked td~ 
tumor growth and metastasis. Cadherin malfunction is also implicated in other human 
diseases, such as pemphigus vulgaris and pemphigus fdliaceus (auto-immune blistering 
skin diseases), Crohn's disiease, and some developmental abnormalities, 

3 0 The cadherin superfamily includes well over forty merhbers, each with a distinct 
pattern-of expression- — ^All members-of-the-superfamily-have-in common-Gonserved 

extracellular, repeats (cadherin domains), but structural differences are foimd in other 
parts of the molecule. The cadherin domains bind calcium to form their tertiary structure 
and thus calcium is required to mediate their adhesion. Only a few amino acids in the 
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first cadherin domain provide the basis for homophilic adhesion; modification of this 
recognition site can change the specificit>' of a cadherin so that instead of recognizing only 
itself, the mutant molecule can now also bind to a different cadherin. In addition, some 
cadherins engage in heterophilic adhesion with other cadherins. 
5 E-cadherin, one member of the cadherin superfamily, is expressed in epithelial cell 

types. Pathologically, if E-cadherin expression is lost in a tumor, the malignant cells 
become, invasive and the cancer metastasizes: Transfection of cancer cell lines with 
polynucleotides expressing E-cadherin has reversed cancer-associated changes by 
- returning altered cell shapes to normal, restoring cells' adhesiveness to each other and to 

10 their substrate, decreasing the cell growth rate, and drastically reducing anchorage- 
independent cell growth. Thus, reintroducing E-cadherin expression reverts carcinomas 
to a less advanced stage. It is likely that other . cadherins have the same invasion 
suppressor role in carcinomas derived from other tissue types. Therefore, proteins of the 
present invention with cadherin activity, and polynucleotides of the present invention 

15 encoding such proteins, can be used to treat cancer. Introducing such proteins or 
polynucleotides into cancer cells can reduce or eliminate the cancerous changes observed 
in these cells by providing norinal cadherin expression. 

Cancer cells have also been shown to express cadherins of a different tissue type 
than their origin, thus allowing these cells to invade and metastasize in a different tissue 

2 0 in the body. Proteins of the present invention with cadherin activity, and polynucleotides 

of the present invention encoding such proteins, can be substituted in these cells for the 
inappropriately expressed cadherins, restoring normal cell adhesive properties and 
reducing or eliminating the tendency of the cells to metastasize. 

Additionally,, proteins of the present invention with cadherin activity, and 
25 polynucleotides of the present invention encoding such proteins, can used to generate 
antibodies recognizing and binding to cadherins. Such antibodies can be used to block 
the adhesion of inappropriately expressed tumor-cell cadherins, preventing the cells from 
forming a tumor elsewhere. Such an anti-cadherin antibody can also be used as a marker 
for the grade, pathological type, and prognosis of a cancer, i.e. the more progressed the 

3 0 cancer, the less cadherin expression there will be, and this decrease in cadherin expression 
^ ^can"be detected'by~the'us"e~pf 'a~ca[ahef ' 

Fragments of proteins of the present invention with cadherin activity, preferably 
a; polypeptide comprising a decapeptide of the cadherin recognition site, and poly- 
nucleotides of the present invention encoding such protein fragments, can also be used 
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to block cadherin function by binding to cadherins and preventing them from binding in 
ways that produce undesirable effects. Additionally, fragments of proteins of the present 
invention with cadherin activity, preferably truncated soluble cadherin fragments which 
have been found to be stable in the circulation of cancer patients, and polynucleotides 
5 encoding such protein fragments/ can be used to disturb proper cell-cell adhesion. 

Assays for cadherin adhesive and invasive suppressor activity include, without 
limitation, those described in: Hortsch et al. J Biol Chem 270 (32): 18809-18817, 1995; 
Miyaki et al. Oncogene 11: 2547-2552, 1995; Ozawa et al. Cell 63: 1033-1038, 1990. 

10 Tumor Inhibition Activity - . ■ 

In addition to the activities described above for immunological treatment or 
prevention of tumors, a protein of the invention may exhibit other anti-tumor activities. 
A protein may inhibit tumor growth directly or indirectly (such as, for example, via 
ADCC). A protein may exhibit its tumor irvhibitory activity by acting on tumor tissue or 

15 tumor precursor tissue, by inhibiting formation of tissues necessary to support tumor 
growth (such as, for example, by inhibiting angiogenesis), by causing production of other 
factors, agents or cell types which inhibit tumor growth, or by suppressing, eliminating 
or inhibiting factors, agents or cell types which promote tumor growth, 

20 Other Activities 

A protein of the invention may also exhibit one or more of the following additional 
activities or effects: inhibiting the growth, infection or function of, or killing, infectious 
• agents, including, without limitation, bacteria, viruses, fungi and other parasites; effecting 
(suppressing or enhancing) bodily characteristics, including, without limitation, height, 

2 5 weight, hair color, eye color, skin, fat to lean ratio or other tissue pigmentation, or organ 

or body part size oFshape (such asTfor example, breast augmentation or diminution, 
change in bone form or shape); effecting biorhythms or caricadic cycles or rhythms; ^ 
effecting the fertility of male or female subjects; effecting the metabolism, catabolism, 
anabolism, processing, utilization, storage or elimination of dietary fat, lipid, protein, 

3 0 carbohydrate, vitamins, minerals, cofactors or other nutritional factors or component(s); 
effecting^behavioralcharacteristies^ including7^without-limitationv-appetite,-H 

cognition (including cognitive disorders)', depression (including depressive disorders) and 
violent behaviors; providing analgesic effects or other pain reducing effects; promoting 
differentiation and grow-th of embryonic stem cells fn lineages other than hematopoietic 
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lineages; hormonal or endocrine activity; in the case of enzymes, correcting deficiencies 
of the enzyme and treating deficiency-related diseases; treatment of hyperproliferative 
disorders (such as, for example, psoriasis); immunoglobulin-like activit}^ (such as, for 
example, the ability to bind antigens or complement); and the ability to act as an antigen 
5 in a vaccine composition to raise an immune response against such protein or another 
material or entity which is cross-reactive with such protein. 

ADMINISTRATION AND DOSING 

10 A protein of the present invention (from whatever soiarce derived, including 

without limitation from recombinant and non-recombinant sources) may be used in a 
pharmaceutical composition when combined with a pharmaceutically acceptable carrier. 
Such a composition may also contain (in addition to protein and a carrier) diluents, fillers, 
salts, buffers, stabilizers, solubilizers, and other materials well known in the art. The term 

15 "pharmaceutically acceptable" means a non-toxic material that does not interfere with the 
effectiveness of the biological activity of the active ingredient(s). The characteristics of the 
carrier will depend on the route of .administration. The pharrnaceutical composition of 
the invention may also contain cytokines, lymphokines, or other hematopoietic factors 
such as M-CSF, GM-CSF, TNF, IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-11, 

2 0 IL-12, IL-13, IL-14, IL-15, IFN, TNFO, TNFl, TNF2, G-CSF, Meg-CSF, thrombopoietin, stem 

. cell factor, and erythropoietin. The pharmaceutical composition may further contain other 
agents which either enhance the activity of the protein or compliment its activity or use 
in treatment. Such additional factors and/or agents may be included in the 
pharmaceutical composition to produce a synergistic effect with protein of the invention, 
25 or to minimize side effects. Conversely, protein of the present invention may be included 
in formulations of The particular cytokine, lymphokine, otlierhematopoietic factor, 
thrombolytic or anti- thrombotic factor, or anti-inflammatory agent to miiumize side effects 
of the cytokine, lymphokine, other hematopoietic factor, thrombolytic or anti-thrombotic 
factor, or anti-inflammatory agent. 

3 0. A protein of the present invention may be active in multimers (e.g., heterodimers 
— or-homodimers) or complexes-with-itself-or-other proteinsr-As a-resultTT^^ 

compositions of the invention may comprise a protein of the invention in such multimeric 
or complexed form. 
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The pharmaceutical composition of the invention may be in the form of a complex 
of the.protein(s) of present invention along with protein or peptide antigens. The protein 
and /or peptide antigen will deliver a stimulator}'' signal to both B and T lymphocytes. B 
lymphocytes will respond to antigen through their surface immunoglobulin receptor. T 
5 lymphocytes will respond to antigen through the T cell receptor (TCR) following 
presentation of the antigen by MHC proteins. MHC and structurally related proteins 
including those encoded by class I and class II MHC genes on host cells will serve to 
present the peptide antigen(s) to T lyrhphocytes. The antigen components could also be 
supplied as purified MHC-pepHde complexes alone or with co-stimulatory molecules that 

1 0 can directly signal T cells. Alternatively antibodies able to bind surface immunolgobulin 
and other molecules on B cells as well as antibodies able to bind the TCR and other 
molecules on T cells can be combined with the pharmaceutical composition of the 
invention. ' 

The pharmaceutical composition of the invention may be in the form of a liposome 

15 in which protein of the present invention is combined, in addition to other 
pharmaceutically acceptable carriers, with amphipathic agents such as lipids which exist 
in aggregated form as micelles, insoluble monolayers, liquid crystals, or lamellar layers 
in aqueous solution. Suitable lipids for liposomal formulation include, without limitation, 
monoglycerides, diglycerides, sulfatides, lysolecithin, phospholipids, saponin, bile acids, 

2 0 and the like. Preparation of such liposomal formulations is within the level of skill in the 

art, as disclosed, for example, in U.S. Patent No. 4,235,871; U.S. Patent No. 4,501,728; U.S. 
Patent No, 4,837,028; and U.S. Patent No. 4,737,323, all of which are incorporated herein 
by reference.. > . 

As used herein, the term "therapeutically effective amount" means the total 
25 amount of each active component of the pharmaceutical composition or method that is 
sufficient to show a meaningful patient benefit, i.e., treatment, healing, prevention or 
amelioration of the relevant medical condition, or an increase in rate of treatment, healing, 
prevention or amelioration of such conditions. When applied to an individual active 
ingredient, administered alone, the term refers to that ingredient alone. When applied to 

3 0 a combination, the term refers to combined amounts of the active ingredients that result 

in the therapeutic effect,- whether adrninistered iri combination,-seria lly or s^^ 

In practicing the method of treatment or use of the present invention, a 
therapeutically effective amount of protein of the present invention is administered to a 
mammal having a condition to be treated. Protein of the present invention may be 
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administered in accordance with the method of the invention either alone or in 
combination with other therapies such as treatments employing cytokines, lymphokines 
or other hematopoietic factors. When co-administered with one or more cytokines, 
lymphokines or other hematopoietic factors, protein of the present invention may be 
5 administered either simultaneously with the cytokine(s), lymphokine(s), other 
hematopoietic factor(s), thrombolytic or anti- thrombotic factors, or sequentially. If 
administered sequentially, the attending physician will decide on the appropriate 
sequence of administering protein of the present invention in combination with 
cytokine(s), lymphokine(s), other hematopoietic factor(s), thrombolytic or anti-thrombotic 

10 factors. ' ^ 

Administration of protein of the present invention used in the pharmaceutical 
composition or to practice the method of the present invention can be carried out in a 
variety of conventional ways, such as oral ingestion, inhalation, topical application or 
cutaneous, subcutaneous, intraperitoneal, parenteral or intravenous injection. 

15 Intravenous administration to the patient is preferred. 

When a therapeutically effective amount of protein of the present invention is 
administered orally, protein of the present invention will be in the form of a tablet, 
capsule, powder, solution or elixir. When administered in tablet form, the pharmaceutical 
composition of the invention may additionally contain a solid carrier such as a gelatin or 

2 0 an adjuvant. The tablet, capsule, and powder contain from about 5 to 95% protein of the 
present invention, and preferably from about 25 to 90% protein of the present invention. 
When administered in liquid form, a liquid carrier such as water, petroleum, oils of animal 
or plant origin such as peanut oil, mineral oil, soybean oil, or sesame oil, or synthetic oils 
^ ^ may be added. The liquid form of the pharmaceutical composition may further contain 

2 5 physiological saline solution, dextrose or other saccharide solution, or glycols such as 

ethylene glycol, propylene glycol or polyethylene glycol. When administered in liquid 
form, the pharmaceutical composition contains from about 0.5 to 90% by weight of protein 
of the present invention, and preferably from about 1 to 50% protein of the present 
invention. 

3 0 When a therapeutically effective amount of protein of the present invention is 

administered by mtxavelTo^ 

invention will be in the form of a pyrogen-free, parenterally acceptable aqueous solution.. 
The preparation of such parenterally acceptable protein solutions, having due regard to 
pH, isotonicity, stability, and the like, is within the skill in the art. A preferred 
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pharmaceutical composition for intravenous, cutaneous, or subcutaneous injection should * 
contain, in addition to protein of the present invention, an isotonic vehicle such as Sodium 
Chloride Injection, Ringer's Injection, Dextrose Injection, Dextrose and Sodium Chloride 
Injection, Lactated Ringer's Injection, or other vehicle as known in the art. The 
5 pharmaceutical composition of the present invention may also contain stabilizers, 
preservatives, buffers, antioxidants, or other additives known to those of skill in the art. 

The amount of protein of the present invention in. the pharmaceutical composition 
of the present inverition will depend upon the nahire and severity of the condition being 
treated, and on the nature of prior treatments which- the patient has undergone. 

10 Ultimately, the attending physician will decide the amount of protein of the present 
invention with which to treat each individual patient. Initially, the attending physician 
will administer low doses of protein of the present invention and observe the patient s 
response. Larger doses of protein of the present invention may be administered until the 
optimal therapeutic effect is obtained for the patient, and at that point the dosage is not 

15 increased further. It is contemplated that the various pharmaceutical compositions used 
to practice the method of the present invention should contain about 0.01 pg to about 100 
mg (preferably about O.lng to about 10 mg> more preferably about 0.1 pg to about 1 mg) 
of protein of the present invention per kg body weight. 

The duration of intravenous therapy using the pharmaceutical composition of the 

2 0 present invention will vary, depending on the severity of the disease being treated and 
the condition and potential idiosyncratic response of each individual patient. It is 
contemplated that the duration of each application of the protein of the present invention 
will be in the range of 12 to 24 hours of continuous intravenous administration. 
Ultimately the attending physician will decide on the appropriate duration of intravenous 

2 5 therapy using the pharmaceutical composition of the present invention. 

Protein "of ~th"e""inver\fibn Inay als6~be~ use d~to~imrnm^ze^ "animals^" to" obtain" 
polyclonal and monoclonal antibodies which specifically react with the protein. Such 
antibodies may b,e obtained using either the entire protein or fragments thereof as an 
immunogen. The peptide immunogens additipnally may contain a cysteine residue at the 

3 0 carboxyl terminus, and are conjugated to a hapten such as keyhole limpet hemocyarun 
: (KLH).-Methodsior_synthesizing such.peptides-are-knowninihe-art,^for~example,-^ 

R.P. Merrifield, J. Amer.Chem.Soc. 85, 2149-2154 (1963); J.L. Krstenansky, et al, FEES Lett. 
211, 10 (1987). Monoclonal antibodies binding to the protein of the invention may be 
useful diagnostic agents for the immunodetection of the protein. Neutralizing monoclonal 
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antibodies binding to the protein may also be useful therapeutics for both conditions 
associated with the protein and also in the treatment of some forms of cancer where 
abnormal expression of the protein is involved. In the case of cancerous cells or leukemic 
cells, neutralizing monoclonal antibodies against the protein may be useful in detecting 
5 and preventing the metastatic spread of the cancerous cells, which may be mediated by 
the protein. 

For compositions of the present invention which are useful for bone, cartilage, 
tendon or ligament regeneration, the therapeutic method includes administering the 
composition topically, systematically, or locally as an implant or device. When 

10 administered, the therapeutic composition for use in this invention is, of course, in a 
pyrogen-free, physiologically acceptable form. Further, the composition may desirably 
be encapsulated or injected in a viscous form for delivery to the site of bone, cartilage or 
tissue damage. Topical administration may be. suitable for wound healing and tissue 
repair. Therapeutically useful agents other than a protein of the invention which may also 

15 optioiially be included in the composition as described above, may alternatively or. 
additionally, be administered simultaneously or sequentially with the composition in the 
methods of the .invention. Preferably for bone and/ or cartilage formation, the 
composition would include a matrix capable of delivering the protein-containing 
composition to the site of bone and/or cartilage damage, providing a structure for the 

2 0 developing bone and cartilage and optimally capable of being resorbed into the body. 

Such matrices may be formed of materials presently in use for other implanted medical 
applications. ^ ; 

The choice of matrix material is based on biocompatibility, biodegradability^ 
mechanical properties, cosmetic appearance and interface properties. The particular 
25 application of the compositions will define the appropriate formulation. Potential 
matrices for the compositions may be biodegradable and chemically defined calcium 
sulfate, tricalciiimphosphate, hydroxyapatite, polylactic acid, polyglycolic acid and 
polyanhydrides. Other potential materials are biodegradable and biologically well- 
defined, such as bone or dermal collagen. Further matrices are comprised of pure proteins 

3 0 or extracellular matrix components. Other potential matrices are nonbiodegradable and 

" '"ch'emiaUyTiefinedrTt^^ "aluminate^,~oF otBST 

ceramics. Matrices may be comprised of combinations of any of the above mentioned 
types of material, such as polylactic acid and hydroxyapatite or collagen and 
tricalciumphosphate. The bioceramics may be altered in composition, such as in caldum- 
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aluminate-phosphate and processing to alter pore size, particle size, particle shape, and 
biodegradability. 

Presently preferred is a 50:50 (mole weight) copolymer of lactic acid and glycolic 
acid in the form of porous particles havirlg diameters ranging from 150 to 800 microns. 
'5 In some applicahons, it will be useful to utilize a sequestering agent, such as 
carboxymethyl cellulose or autologous blood clot, to prevent the protein compositions 
from disassociating from the matrix. 

A preferred family of sequestering agents is cellulosic materials such as 
alkylcelluloses (including hydroxyalkylcelluloses), including methylcellulose, 

1 0 ethylcellulose, hydroxyethylcellulose, - hydroxypropylcellulose, hydroxyprppyl- 
methylcellulose, and carboxymethylcellulose, the most preferred being cationic salts of 
carboxymethylcellulose (CMC). Other preferred sequestering agents include hyaluronic 
acid, sodium alginate, poly (ethylene glycol), polyoxyethyiene oxide, carboxyvinyl 
polymer and poly(vinyl alcohol). The amoimt of sequestering agent useful herein is 0.5-20 

15 wt%, preferably 1-10 wt% based on total formulation weight, which represents the 
amount necessary to prevent desorbtion of the protein from the polymer matrix and to 
provide appropriate handling of the composition, yet not so much that the progenitor cells 
are prevented from infiltrating the matrix, thereby providing the protein the opportunity 
to assist the osteogenic activity of the progeiiitor cells. 

2 0 In further compositions, proteins of the invention may be combined with other 

agents beneficial to the treatment of the bone and/or cartilage defect, wound, or tissue in 
question. These agents include various growth factors such as epidermal growth factor 
(EGF), platelet derived growth factor (PDGF), transforming growth factors (TGF-a and 
TGF-P), and insulin-like growth factor (IGF). 

2 5 The therapeutic compositions are also presently valuable for veterinary 
applications. . Particularly domestic ariimals and thoroughbred horses, in addition to 
humans, are desired patients for such treatment with proteins of the present invention. 

The dosage regimen of a protein-containing pharmaceutical composition to be 
used in tissue, regeneration will be determined by the attending physician considering 

3 0 various factors which modify the action of the proteins, e.g., amount of tissue weight 
desired to be formed7ti^e site of damagerthe condition of the damaged tissu^^^^^^ 
a wound, type of damaged tissue (e.g., bone), the patient's. age, sex, and diet, the severity 
of any infection, tirne of administration and other clinical factors. The dosage may vary 
with the type of matrix used in the reconstitution and with inclusion of other proteirts in 
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the pharmaceutical composition. For example, the addition of other known growth 
factors, such as IGF I (insulin like growth factor I), to the final composition, may also effect 
the dosage. Progress can be monitored by periodic assessment of tissue/bone growth 
and/or repair, for example. X-rays, histomorphometric determinations and tetracycline 
5 labeling. 

Polynucleotides of the present invention can also be used for gene therapy. Such 
polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a 
mammalian subject. Polynucleotides of the invention may also be administered by other 
known methods for introduction of nucleic acid into a cell or organism (including, without 
1 0 limitation, in the form of viral vectors or naked DNA). 

Cells may also be cultured ex vivo in the presence of proteins of the present 
invention in order to proliferate or to produce a desired effect on or activity in such cells. 
Treated cells can then be introduced in vivo for therapeutic purposes. 

15 Patent and literature references cited herein are incorporated by reference as if 

fully set forth. - 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION:" 

(i) APPLICANT: Jacobs, Kenneth 
McCoy, John M. 
LaVallie, Edward R. 
Racie, Lisa A. 
Merberg, David 
Treacy, Maurice ^ 
Spaulding, Vikki 
Agostino, Michael J. 

(ii), TITLE OF INVENTION: SECRETED PROTEINS AND POLYNUCLEOTIDES 
ENCODING THEM 

(iii) NUMBER OF SEQUENCES: 34 ^ 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genetics Institute, Inc. 

(B) STREET: 87 CambridgePark Drive 

(C) CITY: Cambridge . 

(D) STATE: MA" 

(E) COUNTRY: U.S.A. . 

(F) ZIP: 02140 

(v) COMPUTER READABLE ..-FORM : • 

. (A) MEDIUM TYPE:" Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1. 0, Version #l'. 30 

(vi) CURRENT APPLICATION DATA:- 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: . 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Sprunger , Suzanne A. 
. (B) REGISTRATION NUMBER: 41,323 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 498-8284 

(B) TELEFAX: .(617) 876-5851 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 1521 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : double 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SSQ ID NO : 1 : 

GTAACCTTCT TCTGCGCGGC TGCAGCTCGG GACTTCGGCC TGACCCAGCC CCCATGGCTT 60 

* CAGAAGAGCT ACAGAAAGAT CTAGAAGAGG TAAAGGTGTT GCTGGAAAAG GCTACTAGGA 12 0 
AAAGAGTACG TGATGCCCTT ACAGCTGAAA AATCCAAGAT TGAGACAGAA ATCAAGAACA 180 
AGATGCAACA GAAATCACAG AAGAAAGCAG AACTTCTTGA TAATGAAAAA CCAGCTGCTG 24 0 
TGGTTGCTCC CATAACAACG GGCTATACGG TGAAAATCAG TAATTATGGA TGGGATCAGT 3 00 
CAGATAAGTT TGTGAAAATC TACATTACCT TAACTGGAGT TCATCAAGTT CCCACTGAGA 3 60 
ATGTGCAGGT GCATTTCACA GAGAGGTGAT TTGATCTTTT GGTAAAGAAT CTAAATGGGA 42 0 
AGAGTTACTC CATGATTGTG AACAATCTCT TGAAACCCAT CTCTGTGGAA GGCAGTTCAA 480 

• AAAAAGTCAA GACTGATACA GTTCTTATAT TGTGTAGAAA GAAAGTGGAA AACACAAGGT 54 0 
GGGATTACCT GACCCAGGTT GAAAAGGAGT GCAAAGAAAA AGAGAAGCCC TCCTATGACA 600 
CTGAAACAGA . TCCTAGTGAG . GGATTGATGA ATGTTCTAAA GAAAATTTAT GAAGATGGAG 66 0 
ACGATGATAT GAAGCGAACC ATTAATAAAG CCTGGGTGGA ATCAAGAGAG AAGCAAGCCA 72 0 
AAGGAGACAC GGAATTTTGA GACTTTAAAG TCGTTTTGGG AACTGTGATG TGATGTGGAA 780 
ATACTGATGT TTCCAGTAAG GGAATATTGG TGAGCTGCAT ATATAAATTT GACAGATAGC 84 0 
TATTTACATA GCCTTCTAAG TAAAGGCAAT GAATTCTCCA TTTCCTACTG GAGGATTTAT • 900 
TTAAATAAAA TATGCTTATT AAACACTCCT GCAAAGATGG TTTTATTAGT ACCCTGGTCA 960 
TTTTGTTCAA GGAAGGGTTA TATTGCATTC TCACGTGAAA TATAAAAAGC AAGTCTTGCC . . . 102 0 
CAATAAAAAC GCTACATTGT GTGTATTTTT TGTTCAGCTA AGAATTGGAA AAGTATTTGC 1080 
TTGCCTTTTA AGTTACTGAC ATCAGCTTCC ACCAGTGTAA AAATTGAGTA AAACCTGAAG 1140 
TTTTGCATAA AATGCAAATC GGTGCCTGTG CTTGAAGGTT GCTGTAGAGC ATCTGACCCC 1200 
TTATTACCAC CTTAAGCAAT GTATATGCCA TGCATTACCA TGCACTAATT CAATCACAGG 12 60 

TGTTTCTATC-TAGATTTAAA-TATATTTGTC^AATGAATGTG-GAATAGAAAA- 

ACAATAATAG ACATATCTTT GTATGGTACC AGTTAGTTTT GCCGTGGATC AGATGGTTTA 1380 

TAAAAGTAAT AACCATAAAG CAAAAAATAA TTTGAAAGCC CGTCTATTCC TATGCTCAAT 144 0 
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Ai^AGTTAAGT TTTTTTTCAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1500 
ajW^AAAAAA AAAAAAAAAA A - 1521 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear • . . 
(ii) MOLECULE TYPE :" protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Ala Ser Glu Glu Leu Gin Lys Asp Leu Glu Glu Val Lys Val Leu 
1 5 10 15' 

'Leu Glu' Lys Ala Thr Arg Lys Arg Val Arg .Asp Ala Leu Thr Ala Glu 
20 25 30 

Lys Ser Lys lie Glu Thr Glu lie Lys Asn Lys Met Gin Gin Lys Ser 
35 40 .._ 45 

Gin Lys Lys Ala Glu Leu Leu Asp Asn Glu Lys Pro Ala Ala Val Val 
50 55-60 

Ala 'Pro lie. Thr Thr Gly Tyr Thr Val Lys lie Ser Asn Tyr Gly Trp 
■65 . 70 • 75 80 

Asp Gin Ser Asp Lys Phe Val Lys lie Tyr lie Thr Leu Thr Gly Val 
85 90 95 

His Gin Val Pro Thr Glu Asn Val Gin Val His Phe Thr Glu Arg Ser 
100 . . 105 110 

Phe Asp Leu Leu Val Lys Ash Leu Asn Gly Lys Ser Tyr Ser Met lie 

^"^ 115 " " 120 125 ' 



Val Asn Asn Leu- Leu Lys Pro lie Ser Val Glu Gly Ser Ser Lys Lys 

130 135 140 ' 

Val Lys Thr Asp Thr Val Leu lie Leu Cys Arg Lys Lys Val Glu Asn 

145 150 155 160 

. _ _Thr„ Ar g _Tr p_ Asp 'Tyr L eu_Thrl G_ln_Vai G lu Ijy_s _G1 u_ C ys L ys Glu L ys 

165 ' ' 170 175 

Glu Lys. Pro Ser Tyr Asp Thr Glu Thr Asp Pro Ser Glu Gly Leu Met 

180 185 • 190 
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Asn Val Leu Lys Lys Ile.Tyr Glu Asp Giy Asp Asp A.sp Met Lys Arg 
195 200 205 

Thr lie Asn Lys Ala Trp Val Glu Ser Arg Glu Lys Gin Ala Lys Gly ' 
210 215 220 

Asp Thr Glu Phe 
22 5 - 

( 2 )' INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

■ (A) LENGTH: 13-94 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear . 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

TGCGTCATGC AGTGCGCCGG AGGAACTGTG CTCTTTGAGG CCGACGCTAG GGGCCCGGAA 60 

GGGAAACTGC GAGGCGAAGG TGACCGGGGA CCGAGCATTT -CAGATCTGCT CGGTAGACCT 12 0 

GGTGCACCAC CACCATGTTG GCTGCAAGGC TGGTGTGTCT CCGGACACTA CCTTCTAGGG 180 

TTTTCCACCC AGCTTTCACC AAGGCCTCCC CTGTTGTGAA GAATTCCATC ACGAAGAATC 240 

AATGGCTGTT AACACCTAGC AGGGAATATG CCACCAAAAC ' AAGAATTGGG ATCCGGCGTG 3 00 

GGAGAACTGG CCAAGAACTC AAAGAGGCAG CATTGGAACC ATCGATGGAA AAAATATTTA .3 60 

AAATTGATCA GATGGGAAGA TGGTTTGTTG CTGGAGGGGC TGCTGTTGGT CTTGGAGCAT 420 

TGTGCTACTA TGGCTTGGGA CTGTCTAATG AGATTGGAGC TATTGAAAAG GCTGTAATTT 480 

GGCCTCAGTA TCTHTAAGGAT AGAATTCATT -CCACCTATAT GTACTTAGCA GGGAGTATTG 5 4.0. 

GTTTAACAGC TTTGTCTGCC ATAGCAATCA GCAGAACGCC TGTTCTCATG AACTTCATGA ; 600 

TGAGAGGCTC TTGGGTGAGA ATTGGTGTGA CCTTTGCAGC CATGGTTGGA GCTGGAATGC 660 

TGGTACGATC AATACCATAT GACCAGAGCC CAGGCCCAAA GCATCTTGCT TGGTTGCTAC ' 720 

ATTCTGGTGT GATGGGTGCA GTGGTGGCTC CTCTGACAAT ATTAGGGGGT CCTCTTCTCA 7 80 

TCAGAGCTGC ATGGTACACA GCTGGCATTG TGGGAGGCCT CTCCACTGTG GCCATGTGTG 840 

CGCCCAGTGA AAAGTTTCTG AACATGGGTG CACCCCTGGG AGTGGGCCTG GGTCTCGTCT 900 

TTGTGTCCTC ATTGGGATCT ATGTTTCTTC CACCTACCAC CGTGGCTGGT" GCCACTCTTT 960. 
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ACTCAGTGGC AATGTACGGT G.GATTAGTTC TTTTCAGCAT GTTCCTTCTG TATGATACCC 1020 

AGAAAGTAAT CAAGCGTGCA GAAGTATCAC CAATGTATGG AGTTCAAAAA TATGATCCCA 1080 

TTAACTCGAT GCTGAGTATC TACATGGATA CATTAAATAT ATTTATGCGA GTTGCAACTA 1140 

TGCTGGCAAC TGGAGGCAAC AGAAAGAAAT GAJVGTGACTC AGCTTCTGGC TTCTCTGCTA 1200 

CATCAAATAT CTTGTTTAAT GGGGCAGATA TGCATTAAAT AGTTTGTACA AGCAGCTTTC 1260 

GTTGAAGTTT AGAAGATAAG AAACATGTCA TCATATTTAA ATGTTCCGGT AATGTGATGC .13 2 0 

CTCAGGTCTG CCTTTTTTTC TGGAGAATAA ATGCAGTAAT CCTCTCCCAA ATAAGCACAA 13 80 
AAAAAAAAAA AAAA ^ 1394 
(2) INFORMATION FOR SEQ ID NO : 4 : ' ^ , 

(i). SEQUENCE CKARACTERISTICS.: 

(A) LENGTH: 3 45 amino acids 

(B) TYPE: amino, acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: , linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Leu Ala Ala Arg Leu Val Cys Leu Arg Thr Leu Pro Ser Arg Val 

1 . ' - 5 " ' . ■ .10 15 

Phe His Pro Ala Phe Thr Lys Ala Ser Pro Val Val Lys Asn Ser lie 
20. 25 30 

Thr Lys Asn Gin Trp Leu Leu Thr Pro Ser Arg Glu Tyr Ala Thr Lys " 
35 40 ■ ^ 45 

Thr Arg lie Gly lie Arg Arg Gly Arg Thr Gly Gin Glu Leu Lys Glu 

" 5^ : : 55 " : eo ^ 



Ala Ala Leu Glu Pro Ser Met Glu Lys lie Phe Lys lie Asp Gin Met 
65 70. 75 80 

Gly Arg Trp Phe Val Ala Gly Gly Ala Ala Val Gly Leu Gly Ala Leu 
85 . 90 95 

_?ys_ Tyr Tyr Gl y Leu Gl y L eu Ser Asn Glu lie G l y Ala lie Glu Lys 
100 ' 105 110 

Ala Val lie Trp Pro Gin Tyr Val Lys Asp Arg; lie His Ser Thr Tyr 
115. 120 125 
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Met Tyr Leu Ala Gly Ser lie Gly Leu Thr Ala Leu Ser Ala lie AJLa 
130 135 140 

lie Ser Arg Thr Pro Val Leu Met Asn Phe Met Met Arg Gly Ser Trp 
145 150 155 160 

Val Thr lie Gly Val Thr Phe Ala Ala Met Val Gly Ala Gly Met Leu 
165 170' 175 

Val Arg Ser lie Pro Tyr Asp Gin Ser Pro Gly Pro Lys His Leu Ala 
180 ^ 185 190 

Trp Leu Leu His Ser Gly Val Met Gly Ala Val Val Ala Pro Leu Thr 
195 ' 200 205 

lie Leu Gly Gly_ Pro Leu Leu lie Arg Ala Ala Trp Tyr Thr Ala Gly 
210 215 220 

He Val Gly -Gly Leu Ser Thr Val Ala Met Cys Ala Pro Ser Glu Lys 
225 230 235 240 

Phe. Leu Asn Met Gly Ala Pro Leu Gly Val Gly Leu Gly Leu Val Phe 
245 ^ 250 255 

Val Ser Ser Leu Gly Ser Met Phe'. Leu Pro Pro Thr Thr Val Ala Gly 
260 .265 . 270 

Ala Thr Le'u Tyr. Ser Val Ala Met Tyr Gly Gly Leu Val Leu ' Phe Ser 
275 280 285 

Met Phe Leu Leu Tyr Asp Thr Gin Lys, Val. He Lys Arg Ala Glu Val 
290 295- 300 

Ser Pro Met Tyr Gly Val Gin Lys Tyr Asp Pro He Asn Ser Met Leu 
305 310 315 320 

Ser He Tyr Met Asp Thr Leu Asn He Phe Met Arg Val Ala Thr Met 
325 330 335 

-Leu Ala Thr Gly Gly Asn Arg Lys Lys 

__. 340 ; 345 - _ ■_ . 

(2) INFORMATION FOR SEQ ID NO: 5: * " 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 0 8 base pairs 

(B) TYPE: nucleic acid 
(G) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(ii) . MOLECULE TYPE:: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

GCTTTTTTTT TTTTTTTTGG TTGAGATGGG GTCTCGCCAT GTTTCCCACA CTGATCTTGA ,60 

ACTCCTGGGC TCCAGGAATT CTCCTACTTT GGCCTCCCAA AGTGTTGGGA ATATTGGCAT 120 

. GAACCACAGC ACCTGACTTG CATATTTGTG AATTCCCCAA ATTGCTTTTT TTAAATTGAT .180 

TTCTAATTTC ATTTCATTGT TATGGGGAAC ATACTTTGTA TGGTTTCAAT GTTTTAAAAT 240 

TAATTGAGAC TTGTTTTATG ACTTAGCATA TGGTCTGTGT TGAAGAAGGC TCCATGTACA 3 00 

CTTGAGAATA ATATGTATAC TGTGGTTGTT GGGTGGATTT TCTATGTATG- TTTARGTGAT 3 60 

ATGGTTTTAT AGTGTTGTTT' AARTCTTCTA TTTTCTTCTT TTTCTGCCCA GTTTTATTTT 420 

TGAAAGCATA CTGARGTCTC CAACTCARTG CCTTAGCCTC CTGAGCAGTT GGGACTACAG 4 80 

GCATACGCCA CTACACCCAG CAATTTTTTT GTATTTTTCT GTAGAGACAG AGTTTCACCA 540' 

TGTTGCCTAG GCTGGTCTCA GATTCCTGGA CTCAAGTGAT CTCGATTCCC GGCCTCTGCC 600- 

TCCCGGGGTG CTGGGATTGC AGGCATGAGC TACTATGCCT- GGCAAATTTT ATTTTTCCTT 660 

TTATTTTGTC ACATAATTAA^ AGCTACTCCA GAATTCCCTT GATTTCTGCT TGCCTGGTAT 720 

ATCTTTTTTC CATTTTTTAA CTGTCAGCCT TTTTTGTGCC TGTTAATCTA AAGTATGTGT 7 80 

-TTCGTAGATA ATATGTAGCT GGATCATATT TTAAAAATAT TTATTCTGCC AAGCTCTGTC 840. 

TTTTGATTGG AGTATTCTTT CATTTATGTT TGTAATTACT GATGAGGGGG GCACTAATGT 900 

CTGCTGTTTT GCTATTTGTT TCCCCATGTC TTATGTCTTC ATTACTGACT TTTTTATTAA 9 60 

ACAACTATTT TCTTGGGTAC CATTTTAAGT CCCTCTCCCA CTCATTTTTT AATGTTTTTT .1020 

TGTGTTTACT TTTGTTTTTA TTGTTTGCCC TGATATTAAA ATTAACATTT TACCTTGAAA 1080 

TAGTTGGCTT CAGATTAATA TCAACTTAGT TTCAATAGCA* TAGGAAATTT GCTTCACTAT 1140 

ATTTCCATTT TCTCCCCGTC CTTTGTGCTA TTATTACTAT ACCAATTAGA TCTCTACACA 1200 

ATATAGGCAT ATCAA.CACAT TTTGTAATTA TTTCCTTATC CAGTTGTCTT TTAATATAGA 12 60 

TCTGTGAAGA AAAGTATTAC ACAAATAGAT CTATTCTGTT TTTTATAATT ATTTAATTAC 13 20 

CTTTGGTGGT GCTGTTTATT TTTCATGCAT TTGAGTTACT GTCTAGTATT CATTCATTTC 13 80 

TCTCTGAATC ACTCCCTTTA GTATTGCTTG TAGGGCAGGT CTGCTAGCAT TGAATTCTTT 1440 

TAATTTTTGT GACTCTGCAA ATGCCATAAT TTCTCTTTTG- TTTGTGAAGG ATAGTTOTGC 1500^ 

TAGATACAGA ATTTGCAGTT GGCATTCTTT TTACTTTAGC AGTTTGAAAA TATTTCCCAT 1560 

TGTTGGCCGG GCACAGTGGC TCACGCCTCT GGTCCTAGCA CTTTGGGAGG CCGAGGCGGG 1620 
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CGGATCGTCT GGGGTCGGGA GTTCGGGACC GGCCTGGCCA ■ ATATGGTGAG GCCCTGTTTC 1680 

TGCTAAAATA TAAAAATTGG CTGGGCATGA TGGCGGGTGC CTCTAGTCCC AGCTGCTCGG 17 4 0 

GAGGCTGAGG TGGGGGAGTC GCTTGAGCCC GGGAGATGAT GGCTGTGGTG. AGCCGGGATG 1800 

GCGCCGCTGC ACTCCGGCCT GGGCGGCTGA GTGAGACTCC ATCCCCGAAA AAAAAAAAAA 1860 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA 1908' 
(2) INFORMATION FOR SEQ ID NO : 6 : ' ' ' 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 7 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) .MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ. ID N0:6.:- 

Met Arg Gly Ala Leu Met Ser Ala Val Leu Leu Phe Val Ser Pro Cys 

1 - 5 • , . 10 15 

'Leu Met Ser Ser Leu Leu Thr Phe Leu Leu Asn Asn Tyr Phe Leu Gly 

' ' " 20 ' 25 30 

■ Tyr His Phe Lys Ser, Leu Ser His Ser Phe Phe Asn Val Phe Leu Cys 
35 ^ 40 45 

Leu Leu Leu Phe Leu Leu Phe Ala, Leu lie Leu Lys Leu Thr Phe' Tyr 
,50 55 . 60 

. ,Leu Glu lie Val Gly Phe' Arg Leu lie Ser Thr 
65 -. 70 75 

. (~2 ) INFORMATION FOR-SEQ_ID NO : 7 :_ _ . _ __ . 

■ ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3076 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
-(D) TOPOLOGY: linear 

(ii)- MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7; 
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CTTTTTTTTT TTTTTTCAAT TTCATTTAGT TCTGCCCTGA TCTTGGTTAT TTCCTTTTTT 60 

TCTGCTGGGT TTGGGTTTGG TTTGTTCTTA TTTCTCTAGT TCCTTGAGGT GTGACCTTAR 120 

AATGTCAATT TGTGCTCTTT CAATCTTTTT GATGTAGGCG TTGAGGGCTG TGGACTTTTC 130 

TCTTGGCACT CCCTTTGGTG TATCCCARAG GTTTTGATAG GTTGTGTCAT TATTGCAATT 240 

CAGTTTGAAG AATTTCTTAA TCTCCACCTT GATTTTGTTT TTGACCCAAT GCTCATTCAG 3 00 

GAGCAGGTTA TTTACTTTCC ATGTACTTGC ATGGCTTTGA AGCTTCCTTT TGGAGTTGAT 3 60 

TTCCAGTTTT ATTCCACTGT GATTTGAGAG AGTGCTTTAC ATAATTTCAA TTTTCTTAAT 420 

TTTATTAAGG CTCGTTTTAT GGCCTATAAT ATGGTCTATC TTGGAGAAAG TTCCATGCAC 480 

TGTAGAATAG AATGTGTATT CTGTGGTTGT TGGATGAAAT GTTCTGCATA TATTCCTAGA 540 

TTGCCTCCCC ACAAAAGGTT GCATCAATGT CTGTGTTTCT CTACACCATC TCACCCTTGC 600 

CAACTTCGGG TTTCATCAGA CCTTACTGAT -.TGtCAGTATG ATCTGTGAAA CAAATCTCTC " 660 

AGTTTTGATT TGCATTTT-TT AAATTATGAG AGCTTGAACA CCATTTTACA TGTTTATTGG 72 0 

■CTGTTGTTAT TTCCTTTTTG AGATCTGTTC GTTATATGCT TTGCCCGTTT TTCTGTTGGG 7 80 

TGGTTATTAT TTTTCTTATT GAATGGTATA AGCTCTTTGT AAGTTAAGGA CATTAGCCCT 840 

TAGTCAGATA TTTTGACTTA GGTTTTAATT TTTTTCCACA CAGAAGTTTT AAGCTCTGTG " 9 00 

GCAAATTTAT CAGTCTTATA TCACTACAGG GTTATAAATA TTAGYTATCA CTTCGGGTTT - 9 60 

GTGTCTTGCT TAGAAAGCMT CATTTGAAGA TTGTAAATGT TAGTAAGTTT CCCCATATTT 102 0 

TCCTCTAGGA CTTCCATGGT TTAATTTGTT TTGTTTAAAY TAGGAATTGG CATTCACATC 1080 

CTYTTTTGTC CCAGGTCTCA GAGGTCCCTT GTATCTTATA GAGCAGTATT GTTTTATGTT 1140 

ATTTTCCCAT GTATAATTTA AAAACAAAAT . ACGTTGTTCA AAACAAAATA CAGTGGCAGC 120 0 

AGATAATGGC AGTATCTCTG TAACTGCTGG TAAAGTGTAT TTCATAGTGA AGTGTTCATA 12 6 0 

AACTAAAGAG TCATTGATTT GGTTTCCTGG CTAATTAAAA TCTGAATTCC ATTTGAAGTT 132 0 

CCATTGAAAT CATGGTTTTA CTCTATAGCA GTGGATGTTT TTTCCCAACC TTTCTGATAT 13 80 

TTTTTTCCTT CCTGAGACAG GGTCTTGCTC TGTCACCTGG GATGGAGTGT AGTTGCACCA 1440 

TCAAGGCTTA CTGCAGTCTC AACTCTCTGA GCTCAAGTGA TCCTGCCACC TCAGCCTCTT 1500- 

^ACTAGCAAG^GATTaSvGGC ^iWtTTGfAG ^~r5'6 0" 

AGATGGATTC TCACTATGTT GCCCGGGCTC ATCTTGAACT CGAGCTCAAG CAATCTGTCC .162 0'^ 

ATCTTGGCCT CCCAAAGTGC TGGGATTATA GGCGTGAGCC ACTGCACCTG GCCCCTTTCT 1680 
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GATTATTTTA ATCTATCTTT AAATGTTCAA AGTGATTTGC 
TTAGTTTTTT TTAAATTGAG TGTATTTTAT CTAGATATTT 
TTGGATTTTA TAAATACATC TAAATTTGTT ATTTCCAGAA 
TAGCCTCGTG CTTTGTTATT GTTAGGTTTT TTTTTTTTGT 
TATCTGGAGT GCAGTGGCAC AGTCATAGCT CACTGTACCC 
AATCCTCCCA TCTCAGCCTC CTGAGTAGTT GGGACCACAG 
CTAATTTTTT- AAATTTTGTT CTTAATAGAG ACAGAGTCTT 
CTCAAACTCC TGGGCTCAAG -CGATCCTCCC ACCTCAGCCT 
GATGTGAATC ATTACACCCA GCGTATTAAT GGTTTTGTAT 
TGGAAAGATG AAGTGCTGTG AAATATTGTA GGAGCAGAAA 
ACATGGGCAA AATTTCTGTC TCTTTTCTTA TTTTTGCTTT 
TTGGGAAAGC TACTGATCGG ATGGATCCTT TCAGGAAAGC 
ATTTGCATTA TATAGAACGA TATGAAGACC ATACAATATT 
TTAAAAGGAC GCATATCAAG ATGAAGAAAC AACCCAAAGG- 
GGGCCATCAT CACCATCTGC CGGCTCATTG GCATCAAAGA 
GGTCCATTAA TATGCTCAGC CTCACCCAGG GCCTCTTCCG 
CCCATCAACA GCTGGCTGAT AAGAAGGGCC TCCATGTTGT 
GCCCTCTGCC CATTGTGGTT GCGTCCCCCC GGGGGCCCTT 
AAGATGAGGT TCCAGACGTC AAACTGGACT GGGAAGATGT 
AGCGCTCTGT GTGGTCTAAT TTGAAGAGAG CCGCCACGTA 

CCAGTTCCTG. TGCTGCCCTG CACCTAGGAG AGACTCAGCC 

CCTTGCCTTT TGTTTGTTTT GAGGGAAGTT TAATCTTTAA 
TATAGCTTTC AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
AAAAAAAAAA AAAAAA ' 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



PCT/US98/05972 

CTAATTCATT TAAAGCATAT 174 0 

TTAAAAGGCA GCATCTAACC 1800 

TGCTTCAAAA CAGATCTCTG 1860 

TTTGAGACAG GGTCTTGCTC 192 0 

TCAAACTCCT AAACTCAAGT 198 0 

TCATGCACCA GCATGCCTGG 2 04 0 

GCTGTGTTGT TCAGGCTGGT 210 0 

CCTAAAGTGC ' TGAGATTACG 2 16 0 

AGCAAGTCTT TTGTGGGTGG 222 0 

CTTGAAATGT GGCAAAAACC 22 8 0 

TTTGTTTAAA GGTTTTTCTA 23 4 0 

AAAGAACAGA GCAGTTCACC 2 400 

CCATGATATT TCATTAAGAT 246 0 

-TTACGGCCTC CGCTGCC ACA 2 52 0 

CATGTATGCC AAGGTCTCTG 258 0 

TGGGCTCTCC AGACAGGAAA 2 640 

GGAAATCCGG GAGGAATGTG • 2700 

GAGGAAGGAT CCAGAGCCAG 2760 

GAAGACTGCA CAGGGAATGA 2 82 0 

ACCTCTCTGG CCTTGTGCAG 2 880 

CCTCACAGCT TGGGATGTTA ^940 

ACTCTTTGGA AATAAATAAT 3 000 

AAAAAAAAAA AAAAAAAAAA 3060 

*3 0'7 6 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Gly Lys lie Ser Val Ser Phe Leu lie Phe Ala Phe Leu Phe' Lys 
1 5 10 15 

Gly Phe Ser lie Gly Lys Ala. Thr Asp Arg Met Asp Ala Phe Arg Lys 
20 25' 30 

Ala Lys Asn Arg Ala Val His His Leu His Tyr lie Glu Arg Tyr Glu 
35 40 45 

Asp His Thr lie Phe His Asp lie Ser Leu Arg Phe Lys Arg Thr His 
'50 - 55 60 

lie- Lys Met Lys Lys Gin Pro Lys Gly Tyr. Gly Leu Arg Cys His Arg 
65 • -70 75 80 

.Ala lie lie Thr lie Cys Arg Leu lie Gly lie .Lys Asp Met Tyr Ala 
85 90 . 9'5 

Lys Val Ser Gly Ser lie Asn Met Leu Ser Leu Thr Gin Gly Leu Phe 
' . 100 105 110 

Arg Gly Leu Ser Arg Gin Glu Thr His Gin Gin Leu Ala Asp Lys Lys 
115 120 ' 125 

Gly Leu His Val Val Glu lie Arg Glu Glu Cys" Gly Pro Leu Pro lie 
130 135 140 

Val Val Ala Ser Pro Arg Gly Pro Leu Arg Lys Asp Pro Glii Pro Glu 
145 ' 150 155 160 

Asp . Glu Val Pro Asp Val Lys Leu Asp Trp Glu Asp Val Lys Thr Ala 
165 170 175 

Gin Gly Met Lys Arg Ser Val Trp Ser Asn Leu Lys Arg Ala Ala Thr 
-180 - -185 - 190 - — 



(2) INFORMATION FOR SEQ ID NO : 9 : 

( i ) SEQUENCE '.CHARACTERISTICS : 

(A) LENGTH: 683 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
~(D)~T0POLlD'GYT"riMar ] 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SSQ ID NO: 9: 
CGCCAAGTGC GCATGGGGAC GCTATAGCAA TTCGTTTGCT GTCCTTCCTC TCCTTCGAAG 60 

ATGACAAGGC CTACCATCGT TTCTTCCTGC CTTTGGGCCG TCAGGCAGTT GGTTGGGACC 120 

CGCTCCAACC CTCGGTTCTT CCTGCAATAC AGTGGATACA ATTTGTCATG GCTACTCTGA 180 

GTGTTATAGG TTCAAGTTCA CTTATTGCCT ATGCTGTATT CCATAATATA CAGAAATCTC 240 

CAGAGATAAG ACCACTTTTT TATCTGAGCT TCTGTGACCT GCTCCTGGGA CTTTGCTGGC 3 00 

TCACGGAGAC ACTTCTCTAT GGAGCTTCAG TAGCAJVATAA GGACATCATC TGCTATAACC 3 60 

TACAAGCAGT TGGACAGATA TTCTACATTT CCTCATTTCT CTACACCGTC AATTACATCT 420 
GGTATTTGTA CACAGAGCTG AGGATGAAAC ACACCCAGAG TGGACAGAGC ACATCTCCAC ' 480 

TGGTGATAGA TTATACTTGT CGAGTTGGTC AAATGGCCTT TGTTTTCTCA AGCCTGATAC 540 

CTCTGCTATT GATGACACCT GTATTCTGTC TGGGAAATAC TAGTGAATGT TTCCAAAACT 600 

TCAGTCAGAG CCACAAGTGT ATCTTGATGC ACTCACCACC ATCAGCCATG GCTGAACTTC 660 

CACCTTCTGC CAACACATCT GTC 683 
(2) INFORMATION -FOR SEQ . ID -NO : la: / 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear _ " - 
(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



Met Ala Thr Leu Ser Val lie Gly Ser Ser Ser Leu lie Ala Tyr Ala 

1 ,5 10 15 

Val Phe His Asn lie Gin Lys Ser Pro GIu lie Arg Pro Leu Phe Tyr 

20 25 30 

Leu Ser Phe Cys Asp Leu Leu Leu Gly Leu Cys Trp Leu Thr Glu Thr 

35 40 45 



Leu Leu Tyr Gly Ala Ser Val Ala Asn Lys Asp lie lie Cys Tyr Asn 
50- 55 60 ' 

Leu Gin Ala Val Gly Gin lie Phe Tyr lie ^er Ser Phe Leu Tyr Thr 
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65 ■ 70 75 80 

Val Asn Tyr lie Trp Tyr Leu Tyr Thr Glu Leu Arg Met Lys Kis Thr 
85 90 , 95 • ' 

Gin Ser Gly Gin Ser Thr Ser Pro Leu Val lie Asp Tyr Thr Cys Arg 
100 105 . ' 110 

Val Gly Gin Met Ala Phe Val Phe Ser Ser Leu lie Pro Leu Leu Leu 
115 120 125 

Met Thr Pro Val Phe Cys Leu Gly Asn Thr Ser Glu Cys Phe Gin Asn 
130 - 135 140 

Phe Ser Gin Ser His Lys Cys lie Leu Met His Ser Pro Pro Ser Ala 
145 ' 150 155 ^ 160 

Met Ala Glu Leu Pro Pro Ser Ala Asn Thr Ser Val 
,165 170 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 524 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: *cDNA ' ' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ATATGGCTGG ACGCAGCACA" AATTCCACCA ACTAAAGCAG GAGGCTCGGC GTGATGCAGA ' 60 

TACCCAGACA CCATTATTAT GCTCACAGAA GAGATTCTAT AGCAGGGGCT TAAATTCACT 12 0 

GGAATCCACC CTGACTTTTC CTGCCAGTAC TTCTACCATT TTTTGAAACT ACAATACTGG 180 

AACATCCAGG- AACTGGAGTT -ATTCTACGCT-AATGGATTGG^ AAAGAATGTT GGGAAAGGAC . - 2 4 0 

ATCTTAAATC TTTTCTAACT ATGCCCTAAA CTGCAGAACT CAAAGGAAAT ATAGTGCCAT 300 

TGTTAGTAGT CATTCTAGAT. GAATTGGGAG TATCTCTCCA GTTATTCCCA GATTCACTAG 3 60 

TGATCCTTAA AGTCTCTATT CAGGGAGAGG AAGACACTTT CCATCTCAGA GATAGACTCG 420 

TGTTACCTTG ATGGATATTG GATTTGTCTA AGTCTCTTCT AGAAAAAATA AATTCTAGAT 480 



TATTAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA^ AAAA 52-4 
(2) INFORMATION FOR SEQ ID- NO: 12: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2171 'base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNSSS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 2 : ; 

CCCCGCTACC GGGTTGCGGC CGGAAGCCGG GCGCCGCGGC TCTGCTTCGC TCGGGGATCT 60 

GGCGACATGG CCAGAAAGGC TCTCAAGCTT GCTTCGTGGA CCAGCATGGC TCTTGCTGCC 120 

TCTGGCATCT ACTTCTACAG TAACAAGTAC TTGGACCCTA ATGACTTTGG CGCTGTCAGG 180 

gt6ggcagag CAGTTGCTAC GACGGCTGTC ATCAGTTACG ACTACCTCAC, TTCCCTGAAG 240 

AGTGTCCCTT ATGGCTCAGA GGAGTACTTG CAGCTGAGAT CTAAGGTGCA CCTTCGCTCT 3 00 

. ■ GCCAGGCGTC TCTGTGAGCT CTGCTGTGCC AACCGGGGCA CCTTCATCAA GGTGGGCCAG 3 60 

CACCTGGGGG CTCTGGACTA CCTGTTGCCA GAGGAGTACA CCAGCACGCT GAAGGTACTG 420 
• CACAGCCAGG CTCCACAGAG CAGCATGCAA GAGATCCGCC AGGTCATCCG AGAAGATCTG- : 4 80 

GGCAAGGAGG TGCTCGTTCT GGCTGTGAAG CAGCTGTTCC CAGAGTTTGA GTTTATGTGG 540 

CTTGTGGATG AAGCCAAGAA GAACCTGCCT TTGGAGCTGG ATTTCCTCAA TGAAGGGAGG 600 

AATGCTGAGA 'AGGTGTCCCA GATGCTCAGG CATTTTGACT TCTTGAAGGT CCCCCGAATC 660 

CACTGGGACC TGTCCACGGA^ GCGGGTCCTC CTGATGGAGT TTGTGGATGG CGGGCAGGTC 720 

AATGACAGAG ACTACATGGA GAGGAACAAG ATCGAGGTCA ATGAGGTGAG GTCAAGAGCT 7 80 

CAGGGCTGCT GTGCCGGGGA ACGTGGGCTT GGTCAAGGCT GCCCAGGAAG TGCCTGTGTG 840 

tccagatctc acgccacctg ggcaagatc gatcttcgtc aatggcttcg 900 - 

tgcactgcga tccccacccc ggcaatgtac tggtgcggaa gcaccccggc acgggaaagg 960 

CGGAGATTGT CCTGTTGGAC CATGGGCTTT ACCAGATGCT CACGGAAGAA TTCCGCCTGA 1020 

ATTACTGCCA CCTCTGGCAG TCTCTGATCT GGACTGACAG GAAGAGAGTG AAGGAGTACA 1080 

GCCAGCGACT_GGGAGCCGG.G_GAT<CTCTACC__^^ 

CGTGGGACTC GGTCAACAGA GGCATCAGCC AAGCTCCCGT CACTGCCACT GAGGACTTAG ■ 1200 

AGATTCGCAA CAACGCGGCC AACTACCTCC CCCAGATCAG CCATCTCCTC AACCACGTGC 1260 
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CGCGCCAGAT GCTGCTCATC TTGAAGACCA ACGACCTGCT GCGTGGCATT GAGGCCGCCC 13 20 

TGGGCACCCG CGCCAGCGCC AGCTCCTTTC TCAACATGTC ACGTTGCTGC ATCAGAGCGC 13 80 

r 

TAGCTGAGCA CAAGAAGAAG AATACCTGTT CATTCTTCAG AAGGACCCAG ATCTCTTTCA 1440 

GCGAGGCCTT CAACTTATGG CAGATCAACC TCCATGAGCT CATCCTGCGT GTGAAGGGGT 15 00 

TGAAGCTGGC TGACCGGGTC TTGGCCCTAA TATGCTGGCT GTTCCCTGCT CCACTCTGAG 15 60 

TGGAATTGCT CTCCCTGCCC CATTCTGGTG -TCTTTCCACT CCTCAGCCCC TCATCTTGCC 162 0 

TCCACCCAGC TGCTCCATTT TTGCCACATC GTGGCCCGCA GCCCCAGAGT ' CACTGTCCAT 16 8 0 

GTCACCATCC TCCTCCTCCT TTGGAATCCT cfcCGCACAC TGTGGCCGTT GTCTCAGGGC 17 40 

"CCACAAGCTG AACTGTGGCA TAGCTCTCTC TTCTTGTCCA AGAAGACTCA GCAGCCTACA ■ 18 00" 

TTCCCATTCC .TGGTATGTGC CATTGGGTTG GATGTCCCCA CTACTTCCGT TAACGCTTCC -18 60 

CATTGTCAAG ATGTGCCACG GGTGCCACTG GGGGCACACT GAACTTGTAG GGAGTGTGAT 1920 

TTTGTTGGAG GTGCACATGG TCTCTGAATT TGACAGAGAA CACCTTCCCT TTCCTTGCCA 19 80 

TGTCACCCTC CAGAGGAAGT CACACCTCAG CGAGGTGGTT TGGCATCTGG GGCCAACTCC : 2040 

ATTACAGCTA TGAGCTCACT GCTGTCAGTG ACGTTTGGTG TTTTCTGTAC TGTGTTTCAA 2100 

TAAAAACTCC TTCAAGGTTG CAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2160 

xAAAAAAAAAA a 2171 
(2)" INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : ^ ^ 

(D) TOPOLOGY: linear 



( ii ) MOLECULE TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Ala Arg Lys Ala Leu Lys Leu Ala Ser- Trp Thr Ser Met Ala Leu 
15 10 15 



Ala. Ala Ser Gly lie Tyr Phe Tyr Ser Asn Lys Tyr Leu Asp Pro Asn 
20 25 ■ 30 . 

Asp Phe Gly Ala Val Arg Val Gly Arg Ala Val Ala Thr Thr Ala Val 

35 40 . 45 . 



79 



wo 98/42741 



PCT/US98/05972 



lie Ser Tyr Asp Tyr Leu Thr Ser Leu Lys Ser Val Pro Tyr Gly Ser 
50 55 60 

Giu Glu Tyr Leu Gin Leu Arg Ser Lys Val His Leu Arg Ser Ala Arg 
65 ' 70 - 75 80 - 

Arg Leu Cys Glu Leu Cys Cys Ala Asn Arg Gly Thr ?he lie Lys Val 
85 90 , 95 ^ 

Gly Gin His Leu Gly Ala Leu Asp Tyr Leu Leu Pro Glu Glu Tyr Thr 
100 105 110 

Ser Thr Leu Lys Val Leu His Ser Gin Ala Pro Gin Ser Ser Met Gin 
115 ■ ' 120 125 

Glu lie Arg Gin Val lie Arg Glu Asp Leu Gly Lys Glu Val Leu Val 
130 135 140 

Leu Ala Val Lys Gin Leu Phe Pro Glu Phe Glu Phe Met Trp . Leu Val 
145 150 155 160 ■ 

Asp Glu Ala Lys Lys Asn Leu Pro Leu Glu Leu Asp Phe" Leu Asn Glu 
165 170 175 

Gly Arg Asn Ala Glu Lys Val Ser Gin Met Leu Arg His Phe Asp Phe - 
180 . 185 190 

Leu Lys Varl Pro Arg lie Hrs Trp Asp Leu Ser Thr Glu Arg -Val Leu * 
195 ' 200 205 

Leu Met Glu Phe Val' Asp Gly Gly Gin Val Asn Asp Arg Asp Tyr Met 
210 215 220 

Glu Arg Asn Lys lie Asp Val Asn Glu Val Arg Ser Arg Ala Gin Gly . 
225 .230 235 240 

Cys Cys Ala Gly Glu Arg Gly Leu Gly Gin Gly Cys Pro Gly Ser Ala . 

245 250 255 

Cys Val Ser Arg Ser His Ala Thr Trp Ala Arg Cys lie Val Arg 
. 260 265 270 



(2) INFORMATION FOR SEQ ID NO:14: ' 

(i) SEQUENCE CHARACTERISTICS: 

■ (A) LENGTH: 1613 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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(xi) " SEQUENCE DESCRIPTION: SEQ ID NO:14: 
CATGGCGGCT CCCTTGGTCC TGGTGCTGGT GGTGGCTGTG ACAGTGCGGG CGGCCTTGTT 60 

CCGCTCCAGT CTGGCCGAGT TCATTTCCGA GCGGGTGGAG GTGGTGTCCC CACTGAGCTC 120 

TTGGAAGAGA GTGGTTGAAG GCCTTTCACT GTTGGACTTG GGAGTATCTC CGTATTCTGG 180 

AGCAGTATTT CATGAAACTC CATTAATAAT ATACCTCTTT CATTTCCTAA TTGACTATGC* 240 

TGAATTGGTG TTTATGATAA CTGATGCACT GACTGCTATT GCCCTGTATT TTGCAATCCA 3 00 

GGACTTCAAT AAAGTTGTGT TTAAAAAGCA GAAAGTCCTC CTAGAACTGG AACAGTATGC -3 60. 

CCCAGATGTG GCCGAACTCA TCCGGACCCC TATGGAAATG CGTTACATCC CTTTGAAAGT 42 0 

GGCCCTGTTC TATCTCTTAA ATCCTTACAC GATTTTGTCT TGTGTTGCCA AGTCTACCTG 480' 

TGCCATCAAC AACACCCTCA TTGCTTTCTT' CATTTTGACT ACGATAAAAG TTTGATTATC 540 

TGTAAAATGG GGACAGTAAT TGTACCCACT TCATGGAATT ATTGAGAAGA CTAAATGGCT 600 
TAAGGCAGTG CTTTCCTCAG TGCTATTTTT CTTGCCTTAG CGACATACCA GTCTCTGAAC 660 

CCACTCACCT TGTTTGTCCC AGGACTCCTC TATCTCCTCC AGCGGCAGTA CATACCTGTG 72 0 

AAAATGAAGA GCAAAGCCTT CTGGATCTTT TCTTGGGAGT ATGCCATGAT GTATGTGGGA 780 
AGCCTAGTGG TAATCATTTG CCTCTCCTTC - TTCCTTCTCA GCTCTTGGGA TTTCATCCCC 840 
GCAGTCTATG GCTTTATACT TTCTGTTCCA GATCTCACTC CAAACATTGG TCTTTTCTGG 900 
TACTTCTTGG CAGAGATGTT TGAGCACTTC AiGCCTCTTCT TTGTATGTGT GTTTCAGATC 960 

AACGTCTTCT TCTACACCAT CCCGTTAGCC ATAAAGCTAA. ATCCTGAGAA ACATCTTTGT -1020 

CCTCACCTGC ATCATCATCG TCTGTTCCCT. GCTCTTCCCT GTCCTGTGGC ACCTCTGGAT 1080 

TTATGCAGGA AGTGCCAACT CTAATTTCTT TTATGCCATC ACACTGACCT TCAACGTTGG 1140 

GCAGATCCTG CTCATCTCTG ATTACTTCTA TGCCTTCCTG CGGCGGGAGT ACTACCTCAC 1200 

ACATGGCCTC TACTTGACCG CCAAGGATGG CACAGAGGCC ATGCTCGTGC TCAAGTAGGC 1260 

CTGGCTGGCA CAGGGCTGCA TGGAGCTCAG GGGGCTGTGG GGCCAGAAGY TGGGCCAAGC 1320 

CCTCCAGCCA GAGTTGCCAG CAGGCGAGTG CTTGGGCAGA AGAGGTTCGA GTCCAGGGTC 13 80 

ACAAGTCTCT GGTACCAAAA GGGACCCATG GCTGACTGAC AGCAAGGCCT" ATGGGGAAGA 1440 



ACTGGGAGYT CCCCAACTTG GACCCCCACC TTGTGGCTCT GCACACCAAG GAGCCCCYTC 1500 

CCAGACAGGA AGGAGAAGAG GCAGGTGAGC AGGGCTTGTT AGATTGTGGC TACTTAATAA ■ 1560 

ATGTTTTTTG TTATGAAGTC* TAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 1613 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 
( G ) STRANDEDNE S S : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : , protein 



(xi) SEQUENCE DESCRIPTION: SEQ . ID NO: 15: 

Met Ala Ala Pro Leu Val .Leu Val Leu Val Val Ala Val Thr Val Arg 

1 -5 10 * ■ - 15 , 

' Ala Ala Leu Phe Arg Ser Ser Leu Ala Glu Phe lie Ser Glu Arg Val 

20 ■ 25- 30 

Glu Val Val Ser Pro Leu Ser Ser .Trp Lys Arg Val " Val Glu Gly Leu 
35 , : 40 ' 45 

Ser Leu Leu Asp Leu Gly Val Ser Pro Tyr Ser Gly Ala Val , Phe His 

■ 50 55 " ' 60 

Glu Thr Pro Leu lie .Tie Tyr Leu Phe His - Phe Leu lie Asp" Tyr Ala 
65 70 .75- 80 

Glu Leu Val Phe Met lie Thr Asp Ala Leu Thr Ala lie Ala Leu Tyr 
85 . 90 95 

Phe Ala lie Gin Asp Phe Asn Lys Val Val Phe Lys Lys Gin Lys Leu 
100 105 110 . 

Leu Leu Glu Leu Glu Gin Tyr Ala Pro Asp Val Ala Glu Leu lie Arg 
115 120 125 

Thr Pro Met Glu Met Arg Tyr lie Pro Leu Lys Val Ala Leu Phe Tyr 
130 135 140 



Leu Leu Asn Pro Tyr Thr lie Leu Ser Cys Val Ala Lys Ser Thr Cys 
145 ' 150 155 160 

Ala lie Asn Asn Thr Leu lie Ala Phe Phe lie Leu Thr Thr lie Lys 
165 170. 175 

Val Ser Leu Ser Val Lys Trp Gly Gin 

180 185 



{2) INFORMATION FOR SEQ ID N0:16r. 

(iy SEQUENCE CHARACTERISTICS:^ 
(A) LENGTH: 372 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS :* double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ. ID N0:16: 

AAACCCTGTC GGTCTTGGAG CGACGACGGC AGAACCAGGG TCCCTGGCGG TGCGGCGGGG 6 0 

CCGGCGGGTG CAGCGGAAGC GGCGGCGGCG GCGGCAGTGA CGTCGCCGGG AACCCTAAGG 120 

ACTCTGCAAT ATGAATAATT CCCTAGAGAA CACCATCTCC^ TTTGAAGAGT ACATCCGAGT- 180 

AAAGGCACGG TGTGTCCCGC AACACAGGAT GAAGGAATTT CTGGACTCAC TGGCCTCTAA 24 0 

GGGGCCAGAA GCCCTTCAGG AGTTCCAGCA GACAGCCACC ACTACCATGG TGTACCAACA 3 00 

GGGTGGGAAC TGCATATACA CAGACAGCAC TGAAGTGGCT GGGTCTTTGC TTGAACTTGC 3 60 

CTGTCCAATC AC • 372 

(2) INFORMATION FOR SEQ ID NO: 17: - 

"(i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 602 base pairs 

(B) TYPE: nucleic acid ; 

(C) STRANDEDNESS: double . . 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE :* cDNA , . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CGGGAAGCTC GAAATGGAGA AGGTGAACCT TATGACCCAG ATGTGCTCTA CTATATTTTC 60 

CTGTGTATTC AAAAGTATCT TTTTGAAAAT GGAAGGGTAG ATGACATTTT CTCCGATCTT 120 

TATTATGTTC GGTTCACGGA GTGGCTACAT GAAGTTCTGA AGGATGTTCA GCCCCGGGTC 180 

ACTCCACTTG GCTATGTCTT GCCCAGCCAC GTGACTGAGG AGATGCTATG GGAGTGCAAG 240 

CAGCTTGGGG CTCACTCCCC CTCCACCTTG CTGACCACCC TCATGTTCTT TAATACCAAG 3 00 



TAAGTGTTCT AGAGGCTCCA .CTGCTGGCAT CTGTCCAGTG AAGAGTGTGG AAACTATCCA 360 
AGAGGCCTTC TGAATTCCTC TGACATATAT TTGAGAAACT GGGCTACTGA AAGCCCTAAC 420 
CCCACTTGGC TGCATTTTAT TTGGTAACCA GTGAGGCAAA CACCCTTGCC AGACCCCTAC 4 80 
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CATCCATCTT GATGTGGTTC CTGCACTGGA CACTGCTTGG GTACGGGCCT GCCCAGATCT 540 
TGGGAATGTG GGCAGTGGCT CCTCTGAAGC ACCAGTGGGC AGAGGATGAG TCATGGTATC 600 
CT - 602 

r 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Trp Phe Leu His Trp Thr Leu Leu Gly Tyr Gly Pro Ala Gin lie' 
1 ' 5" 10 ■ 15 ■ 

Leu Gly Met Trp ^Ala Val Ala Pro Leu Lys His Gin Trp Ala Glu- Asp 
20 25 30 

Glu Ser Trp Tyr Pro ' • , . . ' 

35 ' 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: ' . ■ 

" (A) LENGTH: 483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE- TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
TGGGAAAGGG CTTGGACTGT GAAAAGAAAT GTGGCCCCTT TCCATCTTCA AGAGAGATGG 60 
AATTAATGAT GGATGGACCC TGGAGGGAAT CTCCCCAGCC GACTTCCACT GGGCTGACAG 120 
"ACTTTGCTGA- CCAeAGGGGA-ACGATGTT(^T-T-TTGTTT(S-TT -GA-TGATGAGA -CATAAAC-TTA 180- 



GCATTTTAAT GGAAGAAAAA TGAGGGGAAC TTCAATTATG ATTTATTAAA GACAATTTCT 240 
ATTACACCCT CCTTTATGAC AAGTGACATT TTAGATGTAA AAGTAAAAAC TTTACCATGC -300 
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CTTTTTTTTT TTTGTTGGCC TAACATTGAG GCCTTAAAAC CTGAGGCTCC TGTGCCTGAT 



360 



GGAATTCTTG TAACATACAC TTGTGTATCA TATAAAGATA CCACTCTGTT TCTCTTATGT 



420 



ATTCTTACTC TAGTTGTTTA TTAAGAATGA CAAGCACGTC TTTTCAACAA AAAAAAAAAA 



480 



AAA 



483 



(2) INFORMATION FOR SEQ ID NO: 20: - 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1853 base pairs 
.(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear ' • 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION-: SEQ ID NO:20: 

CAGATTCGCT GCTGGAGTGC TGGATGGAGC CTTTCTCTGC CCTCTGTGAC ATTTCCAATT 60 

TTAGATAATG CCTCACATCT CTGTCCCCCC GGGACCCCCT GGAGCCCCCA TGATCCCTAA 120 

GAAGACAGCT TGAACCTAGA TCTCACCCCC AGGATGTTGC GGAGGCTGCT GGAGCGGCCT . 180 

TGCACGCTGG CCCTGCTTGT GGGCTCCCAG CTGGCTGTCA TGATGTACCT GTCACTGGGG 240 

GGCTTCCGAA GTCTCAGTGC CCTATTTGGC CGAGATCAGG GACCGACATT TGACTATTCT 300 

CACCCTCGTG ATGTCTACAG TAACCTCAGT CACCTGCCTG GGGCCCCAGG GGGTCCTCCA 3 60 

GCTCCTCAAG GTCTGCCCTA CTGTCCAGAA CGATCTCCTC TCTTAGTGGG TCCTGTGTCG 420 

GTGTCCTTTA GCCCAGTGCC ATCACTGGCA GAGATTGTGG AGCGGAATCC CCGGGTAGAA 480 

CCAGGGGGCC GGTACCGCCC TGCAGGTTGT GAGCCCCGCT CCCGAACAGC CATCATTGTG 540 

CCTCATCGTG CCCGGGAGCA CCACCTGCGC CTGCI^ ACCACCTGCA~"CCCCTTCTTG '"^00 

CAGCGCCAGC AGCTTGCTTA TGGCATCTAT GTCATCCACC AGGCTGGAAA TGGAACATTT 660 

AACAGGGCAA AACTGTTGAA CGTTGGGGTG , CGAGAGGCCC TGCGTGATGA AGAGTGGGAC 720 

TGCCTGTTCT TGCACGATGT . GGACCTCTTG CCAGAAAATG ACCACAATCT GTATGTGTGT 780 

GACCCCCGGG^GACCCCGCCAJIGCTSCCGTT^ 840 

TACCCCCAGT ACTTCGGAAG AGTCTCAGCA CTTACTCCTG ACCAGTACCT GAAGATGAAT 900 

GGCTTCCCCA ATGAATACTG GGGCTGGGGT GGTGAGGATG ACGACTTGCT ACCAGGGTGC 960 
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GCCTGGCTGG GATGAAGATC TCTCGGCCCC CCACATCTGT AGGACACTAT AAGATGGTGA 102 0~ 

AGCACCGAGG AGATAAGGGC AATGAGGAAA ATCCCCACAG ATTTGACCTC CTGGTCCGTA 10 80 

CCCAGAATTC CTGGACGCAA GATGGGATGA ACTCACTGAC ATACCAGTTG CTGGCTCGAG 1140 

AGCTGGGGCC TCTTTATACC AACA.TCACAG CAGACATTGG GACTGACCCT ' CGGGGTCCTC 12 00 

GGGCTCCTTC TGGGCCACGT TACCCACCTG GTTCCTCCCA AGCCTTCCGT CAAGAGATGC 12 6 0 

TGCAACGCCG GCCCCCAGCC AGGCCTGGGC CTCTATCTAC TGCCAACCAC ' ACAGCCCTCC 1320 

GAGGTTCACA CTGACTCCTC CTTCCTGTCT ACCTTAATCA' TGAAACCGAA TTCATGGGGT 13 80 

TGTATTCTCC CCACCCTCAG CTCCTCACTG TTCTCAGAAG GATGTGAGGG AACTGAACTC 1440 
TGGTGCCGTG CTAGGGGGTA GGGGCCTCTC CCTCACTGCT GGACTGGAGC TGGGCTCCTG - 1500* 

TAGACCTGAG GGTCCNTCTY TGTAGGTCTC CTGTAGGGCT TAKGACTGTG AATCCTTGAT 15 6 0 
GTCATGATTT TATGTGACGA TTCCTAGGAG TCCCTGCCCC TAGAGTAGGA GCAGGGYTGG • 162 0 

ACCCCAAGCC CNTCCYTYTT CCATGGAGAG AAGAGTGATC TGGYTTCTCC TCGGACCTCT 1680 

GTGAATATTT ATTCTATTTA TGGTTCCCGG , GAAGTTGTTT GGTGAAGGAA GCCCCTCCCC 1740 

TGGGCATTTT CTGCCTATGC TGGAATAGCT CCCTCTTCTG.GTCCTGGCTC AGGGGGCTGG 1800 

GATTTTGATA TATTTTCTAA TAAAGGACTT TGTCTCGCAA AAAAAAAAAA AAA " • 18 53 
(2) INFORMATION FOR SEQ ID NO.: 21: . ' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid. 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(ii) MOLECULE, TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID .NO:21: 

Met Leu Arg Arg Leu Leu Glu Arg Pro Cys Thr Leu Ala Leu Leu Val 
1 5 10 15 

Gly S'er Gin Leu Ala Val Met Met Tyr Leu Ser Leu Gly Gly Phe Arg 
20 25 30 



Ser Leu Ser Ala Leu Phe Gly Arg Asp Gin Gly Pro Thr Phe Asp Tyr 
35 40 45 

Ser His Pro Arg Asp Val Tyr Ser Asn Leu Ser His Leu Pro Gly Ala 



86 



wo 98/42741 



PCT/US98/05972 



'50 55 60 

Pro Giy Gly Pro Pro Ala Pro Gin Gly Leu Pro Tyr Cys Pro Glu Arg 
65 70' 75 80 

Ser Pro Leu Leu Val Gly Pro Val Ser Val Ser Phe Ser Pro Val Pro 
85 90 95 

' Ser Leu Ala Glu lie Val Glu Arg Asn Pro Arg Val Glu Pro Gly Gly 
100 105 110 

Arg Tyr Arg Pro Ala Gly Cys Glu Pro Arg Ser Arg Thr Ala lie lie 
115 - ' 120 . . 125 

Val Pro His Arg Ala Arg Glu His His Leu Arg Leu Leu Leu Tyr His 
130 " 135 140 

. Leu His .Pro Phe Leu Gin Arg Gin Gln^Leu Ala Tyr Gly lie Tyr Val 
145 . .150 155 * 160 

ile His Gin Ala Gly Asn Gly Thr Phe Asn Arg Ala Lys Leu Leu Asn 
165 : 170 175 

Val Gly. Val Arg Glu Ala Leu. Arg Asp Glu . Glu Trp Asp Cys Leu Phe 
180 185 190 

Leu His Asp Val' Asp Leu Leu Pro Glu Asn Asp His Asn Leu Tyr Val 
195 - *. - _ 200 ' 205 

Cys Asp Pro Arg Gly Pro Arg His 'Val Ala Val Ala Met Asn Lys Phe 
210 215 220 

Gly Tyr Ser Leu Pro Tyr Pro Gin Tyr Phe Gly Arg Val Ser Ala Leu 
225 . 230 235 240 

Thr Pro Asp Gin Tyr Leu 'Lys Met Asn Gly Phe Pro Asn Glu Tyr Trp 
245 250 255 

Gly Trp Gly Gly Glu Asp Asp Asp Leu Leu Pro Gly Cys Ala Trp Leu 
260 . ' 265 270 



Gly 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS': 

(A) LENGTH: 1686 base pairs 

(B) TYPE: nucleic acid 

^ (C) STRANDEDNESS: double 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
AGATAAAGTA AGTGCTGTTT GGGCTAACAG GATCTCCTCT TGCAGTCTGC AGCCCAGGAC 60 

GCTGATTCCA GCAGCGCCTT ACCGCGCAGC CCGAAGATTC ACTATGGTGA AAATCGCCTT 120 ' 

CAATACCCCT ACCGCCGTGC AAAAGGAGGA GGCGCGGCAA GACGTGGAGG CCCTCCTGAG 180 

CCGCACGGTC AGAACTCAGA TACTGACCGG CAAGGAGCTC CGAGTTGCCA CCCAGGAAAA 240 

AGAGGGCTCC TCTGGGAGAT GTATGCTTAC TCTCTTAGGC CTTTCATTCA TCTTGGCAGG 3 00 

ACTTATTGTT -GGTGGAGCCT GCATTTACAA GTACTTCATG CCCAAGAGCA CCATTTACCG 3 60 

TGGAGAGATG TKCTTTTTTG ATTCTGAGGA TCCTGCAAAT TCCCTTCGTG GAGGAGAGCC 420 

TAACTTCCTG GCTGTGACTG AGGAGGCTGA CATTCGTGAG GATGACAACA TTGCAATCAT . 480 
TGATGTGCCT GTCCCCAGTT TCTCTGATAG TGACCCTGCA GCAATTATTC ATGACTTTGA 5 40 

AAAGGGAATG ACTGCTTACC TGGACTTGTT GCTGGGGAAC TGCTATCTGA TGCCCCTCAA 600 - 

TACTTCTATT GTTATGCCTC CAAAAAATCT GGTAGAGYTC TTTGGCAAAC TGGCGAGTGG 660 

CAGATATCTG CYTCAAACTT ATGTGGTTCG AGAAGACCTA GTTGCTGTGG AGGAAATTCG 720 

.TGA-TGTTAGT AAGGTTGGCA TCTTTATTTA CCAACTTTGC AATAACAGAA AGTCCTTCCG 7 80 
CCTTCGTCGC AGAGACCTCT TGCTGGGTTT CAACAAACGT GCCATTGATA AATGCTGGAA 840 
GATTAGAGAC TTCCCCAACG AATTTATTGT TGAGACCAAG ATCTGTCAAG AGTAAGAGGC . 900 
AACAGATAGA GTGTCCTTGG TAATAAGAAG TCAGAGATTT ACAATATGAC TTTAACATTA 960 

AGGTTTATGG GATACTCAAG ATATTTACTC ATGCATTTAC TCTATTGCTT ATGCTTTAAA 1020 

AAAAGGAAAA GAAAAAAACT ACTAACCACT GCAAGCTCTT GTCAAATTTT- AGTTTAATTG . 1080 

GCATTGCTTG TTTTTTGAAA CTGAAATTAC ATGAGTTTCA TTTTTTCTTT GAATTTATAG 1140 . 

GGTTTAGATT TCTGAAAGCA .'GCATCSAATAT ATCACCTAAC ATCCTGACAA "TAAATTCCAT r200'-- 

CCGTTGTTTT TTTTGTTTGT TTGTTTTTTC TTTTCCTTTA AGTAAGCTCT TTATTCATCT 12 60 

TATGGTGCAG CAATTTTAAA ATTTGAAATA TTTTAAATTG TTTTTGAACT TTTTGTGTAA 132 0 

AATATATCAG ATCTCAACAT TGTTGGTTTC TTTTGTTTTT CATTTTGTAC AACTTTCTTG 13 80' 

^ . AATTTAGAAA TT ACATCTTT GCAGTTCTGT TAGGTGCTC T GTAATTAAC^JTGACTTATAT^ _ 1440 

GTGAACAATT TTCATGAGAC AGTCATTTTT AACTAATGCA GTGATTCTTT CTCACTACTA 1500 

TCTGTATTGT GGAATGCACA AAATTGTGTA GGTGCTGAAT GCTGTAAGGA GTTTAGGTTG 1560 
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TATGAATTCT ACAACCCTAT AATAAATTTT ACTCTATAAA AAAAAAAAAA AAAAAJ\AAAA 1520* 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA A^AAAAAAAAA 1680 

AAAAAA 1686 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 63 amino acids 

(B) TYPE: amino acid - 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Met Val Lys lie Ala Phe Asn Thr Pro Thr Ala Val Gin Lys Glu Glu 

1 . . ■ 5 ' 10 .15 

Ala Arg Gin Asp Val Glu Ala Leu Leu Ser Airg Thr Val Arg Thr Gin 
20 25 30, 

lie Leu T-hr Gly Lys Glu Leu , Arg Val Ala Thr Gin Glu Lys Glu. Gly, 
35 40 , 45 ■ 

Ser Ser Gly Arg Cys Met Leu Thr Leu Leu Gly Leu Ser Phe lie Leu 
50 55 60 

Ala Gly Leu lie Val Gly Gly Ala Cys lie Tyr Lys Tyr Phe Met Pro . 

65 ■ 70 75 .80 

Lys Ser Thr ILe Tyr Arg Gly Glu Met X'aa" Phe Phe .Asp Ser Glu Asp 
85 90 : 95- . 

Pro Ala Asn Ser Leu Arg Gly Gly Glu Pro Asn Phe Leu Pro Val Thr 
100 105 110 _ 



Glu Glu Ala Asp lie Arg Glu Asp Asp Asn lie Ala lie lie Asp Val 
115 120 ^ 125 

Pro Val Pro Ser Phe Ser Asp Ser Asp Pro . Ala Ala lie lie His Asp 
130 " 135 140 

Phe Glu Lys Gly Met Thr Ala Tyr Leu Asp Leu Leu Leu Gly' Asn Cys 
145 15 0 155 160 

Tyr Leu Met Pro Leu Asn Thr Ser lie Val Met Pro Pro Lys Asn: Leu 
165: . 170. 175 " 

Val Glu Xaa Phe Gly Lys Leu^ Ala Ser Gly Arg Tyr Leu Xaa Gin Thr 
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180 185 190 

Tyr Val Val Arg Glu Asp Leu Val Ala Val Glu Glu lie Arg Asp Val 
195 200 205 

Ser Asn Leu Gly lie Phe lie Tyr Gin Leu Cys Asn Asn Arg Lys Ser 
210 ' 215 220 

Phe Arg Leu Arg Arg Arg Asp Leu Leu Leu Gly Phe Asn Lys Arg Ala 
225 230 235 240 

lie Asp Lys Cys Trp Lys lie Arg His Phe Pro -Asn Glu Phe lie Val 
245 250 255 

Glu Thr Lys lie Cys Gin Glu 
260 

(2) INFORMATION FOR SEQ ID NO : 2 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

■' (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc= ^oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
TNCACATTCTC AGTGGGAACT TGATGAAC . ■ 2 9 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE- CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

_ (C) STRANDEDNESS: single 

(dT topology'^ ~i~inear ^ 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



( xi.) -SEQUENCE _DESCRIRTX0N.:._SEQ_ID_N0 ::2 5_: 



ANATATAGGTG GAATGAATTC TATCCTTG 29 
( 2 INFORMATION FOR SEQ ID NO : 2 6 : 
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(i) SEQUENCE CtlARACTERISTICS : 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) molecule' TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = ■ " oligonucleot ide " 



(xi) , SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

GNTATAGTAAT AATAGCACAA • AGGACGGG 29 

( 2 ) INFORMATION " FOR SEQ. ID NO : 27 : ' 

(i) SEQUENCE CHARACTERISTICS: - ' 
.(A) LENGTH: 29 base pairs 
(B) TYPE:' nucleic acid 
^ (C) STRANDEDNESS: single 
(D) ' TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A). DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27:. 
TNGCCAGGAAA CCAAATCAAT GACTCTTT 29 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other, nucleic acid 

(A) DESCRIPTION: /desc -= "oligonucleotide* 



(xi) SEQUENCE DESCRIPTION: SEQ ip,NO:28: 



TMI^TTGACG~1^TG : T 29" 

(:2.). INFORMATION FOR SEQ ID NO; 29: 

(i) SEQUENCE CHARACTERISTICS: • 
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(A) LENGTH: 2 9 base pairs 

(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single' 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
ANAAATGGAGC AGCTGGGTGG AGGCAAGA 
(2) INFORMATION FOR SEQ ID NO : 3 0 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid - 

(C) STRANDEDNESS: single 
■(D) TO POLOG Y : 1 inear 

(ii) MOLECULE TYPE:' other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:.3 0: 
TNCGGAGATAC TCCCAAGTCC AACAGTGA 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE ^TYPE 7' 6 th^ 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 1 : 
GNTTAGG GCTT TC A GTAGCCC AGTTTCTC i. 



(2) INFORMATION. FOR SEQ ID- NO: 32: 

(i)^ SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 9 base pairs 
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(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
ANTGACAGGTA CATCATGACA GCCAGCTG 2 9 

(2) INFORMATION FOR SEQ ID NO : 3 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE:, nucleic acid 

(C) STRANDEDNESS: single ■ ' ■ . , 

(D) TOPOLOGY: linear 

(ii) MOLECULE' TYPE: other nucleic acid- ' ^ 
(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi). SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CNGGATGTTAG GTGATATATT CATGCTGC 2 9 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 64 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 4 : 

Gly Glu Val Lys Ser S.er Gly Leu Leu Cys Arg Gly Thr Trp Ala Trp 
.1 5 10 ■■ 15 



Ser Arg Leu Pro Arg; hys, Cys Leu Cys Val Gin lie Ser Arg. His Leu 
20 25 . 30 

Gly Lys -Met Tyr Ser Glu- Met lie Phe Val Asn Gly Phe Val Kis Cys 
3 5 40 45 
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Asp Pro His Pro Gly Asn Val Leu Val Arg Lys His Pro Gly Thr Gly 
50 55 60 

Lys Ala Glu lie Val Leu Leu Asp His Gly Leu Tyr Gin Met Leu Thr 
65 70' 75 80 

Glu Glu Phe Arg. Leu Asn Tyr Cys His Leu Trp^ Gin- Ser Leu lie Trp 
85 90 ^ 95 

Thr Asp Arg Lys Arg Val Lys. Glu Tyr Ser Gin Arg Leu Gly Ala Gly 
100 105 110 

Asp Leu Tyr Pro Leu Phe Ala Cys Met Leu Thr Ala Arg Ser Trp Asp 
115 120 ' 125 

Ser Val Asn Arg Gly lie Ser Gin Ala Pro Val Thr Ala Thr Glu Asp 
130 135 140 

• Leu Glu lie Arg Asn Asn 'Ala Ala Asn Tyr Leu Pro Gin lie Ser. His 
145 150 " 155 160 

Leu Leu Asn His Val Pro Arg Gin Met Leu Leu lie Leu Lys Thr Asn 
165 170 . 175 

Asp Leu Leu Arg Gly .He Glu Ala Ala Leu Gly. Thr Arg Ala Ser Ala 
180 - 185 190 

Ser Ser Phe Leu Asn Met Ser Arg Cys Cys lie Arg Ala Leu Ala Glu' 
195 200 205 . 

His Lys Lys Lys Asn Thr Cys Ser * Phe Phe Arg' Arg Thr Gin He Ser 
210 215 226 

Phe Ser Glu A;la Phe Asn Leu Trp Gin lie Asn Leu His Glu Leu He 
225 230 235 240 

Leu Arg Val Lys Gly Leu Lys Leu Ala Asp Arg Val Leu Ala Leu He 
245 250 255 

-Cys Trp Leu Phe Pro Ala Pro Leu 

' . 2 60 : „ . _ : - : 
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What is claimed is: 

1. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide, sequence of SEQ ID 

NO:l; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 54 to nucleotide 737; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 188 to nucleotide 671; 

(d) . a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone bfl71_6 deposited under iaccession 
number ATCC 98371; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone bfl71_6 deposited under accession number ATCC 98371; 

(f) ^ a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of done bfl71_6 deposited under accession number 
ATCC 98371; 

(g) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone bf 17116 deposited under accession number ATCC 98371; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:2; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:2 having biological activity, the fragment 
comprising the amino acid sequence from amino acid 109 to amino acid 118 of 
SEQIDNO:2; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
W-rg) above; " ■ ^ '■ 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 



. 2. The polynucleotide of claim 1 wherein said polynucleotide is operably 
linked: to at least one expression control sequence. 
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A host cell transformed with the polynucleotide of claim 2. 

4. The host cell of claim 3, wherein said cell is a mammalian cell. 

5. A process for producing a protein encoded by the polynucleotide of claim 
2, which process comprises: 

(a) growing a culture of the host cell of claim 3 in a suitable culture 
medium; and 

(b) purifying said protein from the culture. 

6. A protein produced according to the process of claim 5. 

7. The protein of claim 6 comprising a mature protein. 

8. A protein comprising an amino acid sequence selected from the group 
consisting of : . 

(a) _ the amino acid sequence of SEQ ID„NO:2; 

(b) the amino acid sequence of SEQ ID NO:2 from amino acid 46 to 
amino acid 206; 

(c) fragments of the amino acid sequence of SEQ ID NO:2 comprising 
the amino acid sequence from amino acid 109 to amino acid 118 of SEQ ID NO:2; 
and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
bfl71_6 deposited under accession number ATCC 98371; 

the protein being substantially free from other mammalian proteins. 



9. The protein of claim 8, wherein said protein comprises the amino acid 
sequence of SEQ ID NO:2. 

10. The protein of claim 8, wherein said protein comprises the amino acid 
sequence of SEQ-ID-NO:2 from amino acid-46-to amino aGid-206 ^ 

11. A composition comprising the protein of claim. 8 and a pharmaceutically 
acceptable carrier. 
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12. A method for preventing, treating or ameliorating a medical condition 
which comprises administering to a mammalian subject a therapeutically effective amount' 
of a composition of claim 11. • ' 

13. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:l. 

14. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:3; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
. NO:3 from nucleotide 135 to nucleotide 1169; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:3 from nucleotide 1 to nucleotide 875; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone ckl81_7 deposited under accession 
number ATCC 98371; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone ckl81_7 deposited under accession number ATCC 98371; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone ckl81_7 deposited under accession number 
ATCC 98371; 

(g) . a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone ckl81_7 deposited under accession number ATCC 98371; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:4; 

(i) apolynuclebtide encoding a protein comprising a fragment of the 
amino add sequence of SEQ ID NO:4 having biological activity, the fragment 
comprising the amino acid sequence from amino acid 167 to amino acid 176 of 
SEQIDNO:4; ' 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 

(a)-(g)-above; — ^ — 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above and 
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(I) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 

15. A protein comprising an amino acid sequence selected from the group 
consisting of : . 

(a) the amino acid sequence of SEQ ID NO:4; 

(b) the amino acid sequence of SEQ ID NO:4 from amino acid 1 to 
amino acid 247; 

(c) . fragments of the amino acid sequence of SEQ ID NO:4 comprising 
the amino acid sequence from amino acid 167 to amino add 176 of SEQ ID NO:4; 
and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
ckl81_7 deposited under accession number ATCC 98371; 

the protein being substantially free from other mammalian proteins. 

. 16. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:3. 
17. An isolated polynucleotide selected from the group, consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:5; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:5 from nucleotide 882 to nucleotide 1106; 

(c) - a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:5 from nucleotide 1050 to nucleotide 1106; 

"(d) polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:5 from nucleotide 1028 to nucleotide 1395; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
. length protein coding sequence of clone co736_3 deposited under accession 

number ATCC 98371; 

(f) a-polynucleGtide-encoding'the-full-length-protein-encoded by the 

cDNA insert of clone co736_3 deposited under accession number ATCC 98371; 
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(g) a polynucleotide comprising the nucleotide sequence of a mature' 
protein coding sequence of clone co736_3 deposited under accession number 
ATCC 98371; 

(h) . a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone co736_3 deposited under accession number ATCC 98371; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:6; 

(j) a polynucleotide encoding a protein comprising a fragment of the' . 
amino acid sequence of SEQ ID NO:6 having biological activity, the fragment 
comprising the amino acid sequence from amino acid 32 to amino acid 41 of SEQ 
IDNO:6; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

(1) - a polynucleotide which encodes a species homologue of the protein 
of (i) or (j) above ; and 

(m) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(j). 

18. A protein comprising an amino acid sequence selected from the group 
consisting of : . 

i 

(a) the amino acid sequence of SEQ ID NO:6; 

(b) fragments of the amino acid sequence of SEQ ID NO:6 comprising 
the amino acid sequence from amino acid 32 to amino acid 41 of SEQ ID NO:6; and 

(c) . the amino acid sequence encoded by the cDNA insert of clone 
co736_3 deposited under accession number ATCC 98371; 

the protein being substantially free from otherlnammaliaii proteins. 

19. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:5. 

20. An isolated polynucleotide selected from the group consisting of: 

• (a)„_^ ^a„p_cdynud.eo,tide_comprisihg-the-nudeotide-sequence-of-SEQ-ro 

NO:7; 

(b) a polynucleotide comprising the nucleotide sequence of SEQTD' 
NO:7 from nucleotide 2283 to nucleotide 2858; 
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(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:7 from nucleotide 1164 to nucleotide 1433; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone dm26_2 deposited under accession 
number ATCC 98371; 

(e) a polynucleotide encoding the full-length protein encoded by the 
* cDNA insert of clone dm26_2 deposited under accession number ATCC 98371; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone dm26_2 deposited under accession number 
ATCC 98371; 

.(g) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone dm26_2 deposited under accession number ATCC 98371; 

(h) a polynucleotide encoding a protein comprising- the amino acid 
sequence of SEQ ID NO:8; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:8 having biological activity, the fragment 
comprising, the amino acid sequence from am'ino acid 91 to amino acid 100 of SEQ 
IDNO:8; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(aHg) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) . a polynucleotide capable of hybridizing imder stringent conditions 
. to any one of. the polynucleotides specified in (a)-(i). 

_ _ _ 21. _ _ A -protein comprising-an aniino acid sequence selected from the'gr^ 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:8; 

(b) fragments of the amino acid sequence of SEQ ID NO:8 comprising 
the amino. acid sequence from amino acid 91 to amino acid 100 of SEQ ID NO:8; 
and : 

(c) ' the amino acid sequence encoded by the cDNA insert of clone 
dm26_2 deposited under accession number ATCC 98371; 

^tiie protein being substantially free from other mammalian proteii\s. 
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22. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:7. 

23. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:9; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NQ:9 from nucleotide 168 to nucleotide 683; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:9 from nucleotide 318 to nucleotide 683; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone eq229_3 deposited under accession 
number ATCC 98371; 

(e) . , a polynucleotide encoding the full-length protein encoded by the 
cDN A insert of clone eq229_3 deposited under accession number ATCC 98371; 

^ (f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone eq229_3 <ieposited under accession number 
ATCC 98371; " 

(g) 3 polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone eq229_3 deposited under accession number ATCC 98371; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:10; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:10 having biological activity/ the fragment 
comprising the amino acid sequence from amino acid 81 to amino acid 90 of SEQ 

_ ID NO:10; - ~ - " " ' 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a spedes homologue of the protein 
of (h) or (i) above ; and 

G) a pol ynucleotide ca pablejof hybridizing-under-^stringent Gonditions- 

to any one of the polynucleotides specified in (a)-(i). 
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24. A protein comprising an amino acid sequence selected from the, group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO: 10; 

(b) the amino acid sequence of SEQ ID NO: 10 from amino acid 53 to 
amino acid 172; 

(c) fragments of the amino acid sequence of SEQ ID NO:10 comprising 
the amino acid sequence from amino acid 81 to amino acid 90 of SEQ ID NO:10; 
and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
eq229_3 deposited under accession number ATCC 98371; 

the protein being substantially free from other mammalian proteins. 

25. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:9 
andSEQIDNO:ll, - 

26. An isolated polynucleotide ^ selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:12; ' ^ 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:12 from nucleotide 67 to nucleotide 879; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:12 from nucleotide 118 to nucleotide 879; 

(d) • a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:12 from nucleotide 1224 to nucleotide 2171; 

- - - - - (e) - - a polynucleotide comprising the nucleotide sequence of the fuU- 
. length protein coding sequence of clone fh3_6 deposited xxnder accession number 
ATCC 98371; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone fh3_6 deposited under accession number ATCC 98371; 

(g) a^pplynucleatide_comprising~the nuc4eotide-sequence-of-a-ma^^ — 

protein coding sequence of clone fh3_6 deposited' imder accession number ATCC 
98371; - . 
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(h) a polynucleotide encoding a matxire protein encoded by the cDNA' 
insert of clone fh3_6 deposited under accession number ATCC 98371; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:13; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO: 13 having biological activity, the fragment 
- comprising the amino acid sequence from amino acid 130 to amino acid 139 of 
SEQIDNO:13; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

(1) a polynucleotide which encodes a species honiologue of the protein 
of (i) or (j) above ; and 

(m) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(j). 

27. A protein comprising an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO: 13; 

(b) the amino acid sequence of SEQ ID NO:13 from amino acid 1 to 
amino acid 119; 

(c) fragments of the amino acid sequence of SEQ ID NO:13 comprising 
the amino acid sequence from amino acid 130 to amino acid 139 of SEQ ID NO:13; 
and 

(d) . the amino add sequence encoded by the cDN A insert of clone fh3_6 
deposited under accession number ATCC 98371; 

- - -the protein-being substantially free from other mammalian proteins. 

28. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:12. 

29. An isolated polynucleotide selected from^ the group consisting of: 



(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:14; 
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(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO: 14 from nucleotide 2 to nucleotide 556; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:14 from nucleotide 53 to nucleotide 556; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:14 from nucleotide 1 to nucleotide 367; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone fs87_3 deposited under accession number 
ATCC 98371; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone fs87_3 deposited under accession number ATCC 98371; 

(g) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone fs87_3 deposited under accession number ATCC 
98371; ' ■ 

(h) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone fs87_3 deposited under accession number ATCC 98371; 

(i) -_ a polynucleotide encoding a protein comprising the amino' acid 
sequence of SEQ ID NO:15; 

(j) , a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:15 having biological activity, the fragment 
comprising the amino acid sequence from amino acid 87 to amino acid 96 of SEQ 
IDNO:15; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 
" of (i) or (j) above ;'anci ' . 

(m) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(j). 

30; A protein comprising an amino acid sequence selected from the group 

^consisting-of:— . — . — ~ " 

(a) the amino acid sequence of SEQ ID NO: 15; 
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(b) fragments of the amino add sequence of SEQ ID NO:15 comprising 
the amino acid sequence from amino acid 87 to amino acid 96 of SEQ ID NO;15; 
and 

(c) the amino acid sequence encoded by the cDNA insert of done 
fsS7_3 deposited under accession number ATCC 98371; 

,the protein being substantially free from other mammalian proteins. 

31. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:14. 

32. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:17; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:17 from nucleotide 492 to nucleotide 602; . 

(c) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone fy530_2 deposited under accession 
number ATCC 98371; ; - _ " ' " . " 

(d) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone fy530_2 deposited under accession number ATCC 98371; 

(e) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone fy530_2 deposited imder accession number 
ATCC 98371; 

(f) a polynudeotide encoding a mature protein encoded by the cDNA 
insert of clone fy530_2 deposited under accession number ATCC 98371; 

(g) a polynucleotide encoding a protein comprising the amino add 
. ~ sequence of SEQ ID NO: 18; . - ^ " 

(h) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:18 having biological activity, the fragment 
comprising the amino add sequence from amino acid 13 to amino acid 22 of SEQ 
ID NO:18; ' 

(i) a p.olynucleotide-which4s"an alleUe-variant of a polynucle^^^ 

(a)-(f) above; 

(j) a polynudeotide which encodes a spedes homologue of the protein 
of (g) or (h) above ; and 
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(k) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(h). 

33. A, protein comprising an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:18; 

(b) fragments of the amino acid sequence of SEQ ID NO:18 comprising 
the- amino acid sequence from amino acid 13 to amino acid 22 of SEQ ID NO:18; 
and 

(c) the amino acid sequence encoded by the cDNA insert of clone 
fy530_2 deposited under accession number ATCC 98371; 

the protein being substantially free from other mammalian proteins. 

34. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:17, 
SEQ ID NO:16, and SEQ ID NO:19 . 

35. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:20; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:20 from nucleotide 154 to nucleotide 972; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:20 from nucleotide 1 to nucleotide 341; 

(d) - a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of done ge51_l deposited under accession number 

ATCC 9837iy " ' " . " T " 

(e) . a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone ge51_l deposited under accession number ATCC 98371; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of done ge51_l deposited under accession number ATCC 

. 98371;-^ — 

(g) a polynucleotide encoding a mature protein encoded by the cDN A 
insert of clone ge51_l deposited under accession number ATCC 98371; 
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(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:21; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:21 having biological activity, the fragment 
comprising the amino acid sequence from amino acid 131 to amino acid 140 of 
SEQ ID NO:21; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 

36, A protein comprising an amino acid sequence selected from the group . 
consisting of: 

(a) the amino "acid sequence of SEQ ID NO:21; 

(bX the amLirio. acid sequence of SEQ ID NO:21 from amino acid 1 to 
arnino acid 62; 

(c) fragments of the amino acid sequence of SEQ ID NO:21 comprising 
the amino acid sequence from amino acid 131 to amino acid 140 of SEQ ID NO:21; 
and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
ge51_l deposited under accession number ATCC 98371; 

the protein being substantially free from other mammalian proteins. 

' ' \ ^ 37: " Ah isolated'gehe cdSespbnding to the cDNA sequence of SEQ ID NO:20. 

38. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:22; — 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:22 from nucleotide 104 to nucleotide 892; 
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(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:22 from nucleotide 299 to nucleotide 892; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:22 from nucleotide 798 to nucleotide 1261; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone gxl83_l deposited under accession 
number ATCC 98371; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone gxl83_l deposited under accession number ATCC 98371; 

(g) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone gxl83_l deposited under accession number 
ATCC 98371; 

(h) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone gxl83_l deposited under accession number ATCC 98371; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:23; ~ 

(j) a polynucleotide encoding a protein. comprising a fragment, of the 
amino acid sequence of SEQ ID NO:23 having biological activity, the fragment 
comprising the amino acid sequence from amino acid 126 to amino acid 135 of 
SEQIDNO:23; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

■ (1) a polynucleotide which encodes a species homologue of the protein 
of (i) or (j) above ; and 

(m) a polynucleotide capable of hybridizing xmder stringent conditions 
to any one of the polynucleotides specified in (a)-(j). 

39. A protein comprising an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:23; 

(b) — the amino acid sequence-of:SEQTD NO:23"from amino acid^53 to~ 

amino acid 89; 
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(c) fragments of the amino add sequence of SEQ ID NO:23 comprising 
the amino acid sequence from amino acid 126 to amino acid 135 of SEQ ID NO:23; 
and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
gxl83_l deposited under accession number ATCC 98371; 

the protein being substantially free from other mammalian proteins. 

40. An isolated gene corresporiding to the cDNA sequence of SEQ ID NO:22. 
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FIGURE lA 




BamHi 2772 C!a! 26S6 



Plasmid name: pED6dpc2 
Plasmid size: 5374 bp 



Xomments/References:-~^pED6dpc2Hs-derfved-from"pED6dpcl~by-insel1io^^ " 
polylinker to facilitate cDN A cloning. SST cDNAs are cloned between EcoRI and Notl. 
pED vectors are described in Kaufman et al.(1991), NAR 19: 4485-4490.' 
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FIGURE IB 




Sap! 213-5 



Plasmid name; pNOTs 
Plasinid size: 4529 bp 



Comm'ents/Hoferoncos: pNOTs is a derivative i pMT2 (Kaufman et al,1989. WJol:CeD.Biol.9: 1741 -1750). 
OHFR was deleted and a new polylinker was insorled between EcoRl and Hpal. M13 origins 
oi replication was Inserted In the Clal site. SST cDNAs are cloned between EcoRI and 
NotI 
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Polynucleotide comprising the nucleotide sequence of 
Seq.ID:3 and encoding a polypeptide of Seq.ID:4 or 
fragments. Polynucleotide fragments, variants, homologues, 
gene thereof. Protein comprising an amino- acid sequence, of . 
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